DNA sequences for human tumour suppressor genes

ABSTRACT

A method for the diagnosis of a predisposition thereto, in a patient, comprising the steps of: (1) establishing the level of expression of a gene selected from the group consisting of PR domain containing 7 protein (PRDM7), CDT1 DNA replication factor (CDT1), charged multivesicular body protein 1/chromoatin modifying protein 1 (CHMP1), BTG3 associated nucleoprotein (BANP), BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44; and (2) comparing expression of the gene to a baseline established from expression in normal tissue controls; wherein substantial variance from the baseline indicates that the patient is susceptible to cancer.

TECHNICAL FIELD

[0001] The present invention is concerned with DNA sequences from the 16q24.3 region which encode functional domains indicative of a potential role in the tumourigenic process.

BACKGROUND ART

[0002] The development of human carcinomas has been shown to arise from the accumulation of genetic changes involving both positive regulators of cell function (oncogenes) and negative regulators (tumour suppressor genes). For a normal somatic cell to evolve into a metastatic tumour it requires changes at the cellular level, such as immortalisation, loss of contact inhibition and invasive growth capacity, and changes at the tissue level, such as evasion of host immune responses and growth restraints imposed by surrounding cells, and the formation of a blood supply for the growing tumour.

[0003] Molecular genetic studies of colorectal carcinoma have provided substantial evidence that the generation of malignancy requires the sequential accumulation of a number of genetic changes within the same epithelial stem cell of the colon. For a normal colonic epithelial cell to become a benign adenoma, progress to intermediate and late adenomas, and finally become a malignant cell, inactivating mutations in tumour suppressor genes and activating mutations in proto-oncogenes are required (Fearon and Vogelstein, 1990).

[0004] The employment of a number of techniques, such as loss of heterozygosity (LOH), comparative genomic hybridisation (CGH) and cytogenetic studies of cancerous tissue, all of which exploit chromosomal abnormalities associated with the affected cell, has aided in the identification of a number of tumour suppressor genes and oncogenes associated with a range of tumour types.

[0005] In one aspect, studies of cancers such as retinoblastoma and colon carcinoma have supported the model that LOH is a specific event in the pathogenesis of cancer and has provided a mechanism in which to identify the cancer causing genes. For instance in colorectal carcinoma, inherited forms of the disease have been mapped to the long arm of chromosome 5 while LOH at 5q has been reported in both the familial and sporadic versions of the disease. The APC tumour suppressor gene, mapping to this region, was subsequently shown to be involved (Groden et al., 1991). The model is further highlighted in Von Hippel-Lindau (VHL) syndrome, a rare disorder that predisposes individuals to a variety of tumours including clear cell carcinomas of the kidneys and islet cell tumours of the pancreas. Both sporadic and inherited cases of the syndrome show LOH for the short arm of chromosome 3 and somatic translocations involving 3p in sporadic tumours, and genetic linkage to the same region in affected families has also been observed. The VHL tumour suppressor gene has since been identified from this region of chromosome 3 and mutations in it have been detected in 100% of patients who carry a clinical diagnosis of VHL disease. In addition, the VHL gene is inactivated in approximately 50-80% of the more common sporadic form of renal clear cell carcinoma.

[0006] The genetic determinants involved in breast cancer are not as well defined as that of colon cancer due in part to the histological stages of breast cancer development being less well characterised. However, as with colon carcinoma, it is believed that a number of genes need to become involved in a stepwise progression during breast tumourigenesis.

[0007] Certain women appear to be at an increased risk of developing breast cancer. Genetic linkage analysis has shown that 5 to 10% of all breast cancers are due to at least two autosomal dominant susceptibility genes. Generally, women carrying a mutation in a susceptibility gene develop breast cancer at a younger age compared to the general population, often have bilateral breast tumours, and are at an increased risk of developing cancers in other organs, particularly carcinoma of the ovary.

[0008] Genetic linkage analysis on families showing a high incidence of early-onset breast cancer (before the age of 46) was successful in mapping the first susceptibility gene, BRCA1, to chromosome 17q21 (Hall et al., 1990). Subsequent to this, the BRCA2 gene was mapped to chromosome 13q12-q13 (Wooster et al., 1994) with this gene conferring a higher incidence of male breast cancer and a lower incidence of ovarian cancer when compared to BRCA1.

[0009] Both BRCA1 and BRCA2 have since been cloned (Miki et al., 1994; Wooster et al., 1995) and numerous mutations have been identified in these genes in susceptible individuals with familial cases of breast cancer.

[0010] Additional inherited breast cancer syndromes exist, however they are rare. Inherited mutations in the TP53 gene have been identified in individuals with Li-Fraumeni syndrome, a familial cancer resulting in epithelial neoplasms occurring at multiple sites including the breast. Similarly, germline mutations in the MMAC1/PTEN gene involved in Cowden's disease and the ataxia telangiectasia (AT) gene have been shown to confer an increased risk of developing breast cancer, among other clinical manifestations, but together account for only a small percentage of families with an inherited predisposition to breast cancer.

[0011] Somatic mutations in the TP53 gene have been shown to occur in a high percentage of individuals with sporadic breast cancer. However, although LOH has been observed at the BRCA1 and BRCA2 loci at a frequency of 30 to 40% in sporadic cases (Cleton-Jansen et al., 1995; Saito et al., 1993), there is virtually no sign of somatic mutations in the retained allele of these two genes in sporadic cancers (Futreal et al., 1994; Miki et al., 1996). Recent data suggests that DNA methylation of the promoter sequence of these genes may be an important mechanism of down-regulation. The use of both restriction fragment length polymorphisms and small tandem repeat polymorphic markers has identified numerous regions of allelic imbalance in breast cancer suggesting the presence of additional tumour suppressor genes, which may be implicated in breast cancer. Data compiled from more than 30 studies reveals the loss of DNA from at least 11 chromosome arms at a frequency of more than 25%, with regions such as 16q and 17p affected in more than 50% of tumours (Devilee and Cornelisse, 1994; Brenner and Aldaz, 1995). However only some of these regions are known to harbour tumour suppressor genes shown to be mutated in individuals with both sporadic (TP53 and RB genes) and familial (TP53, RB, BRCA1, and BRCA2 genes) forms of breast cancer.

[0012] Cytogenetic studies have implicated loss of the long arm of chromosome 16 as an early event in breast carcinogenesis since it is found in tumours with few or no other cytogenetic abnormalities. Alterations in chromosome 1 and 16 have also been seen in several cases of ductal carcinoma in situ (DCIS), the preinvasive stage of ductal breast carcinoma. In addition, LOH studies on DCIS samples identified loss of 16q markers in 29 to 89% of the cases tested (Chen et al., 1996; Radford et al., 1995). In addition, examination of tumours from other tissue types have indicated that 16q LOH is also frequently seen in prostate, lung, hepatocellular, ovarian, primitive neuroectodermal and Wilms' tumours.

[0013] Together, these findings suggest the presence of a tumour suppressor gene mapping to the long arm of chromosome 16 that is critically involved in the early development of a large proportion of breast cancers as well as cancers from other tissue types, but to date no such gene has been identified.

DISCLOSURE OF THE INVENTION

[0014] The present invention provides nucleic acid and protein sequences that represent tumour suppressor genes involved in breast cancer, herein termed “tumour suppressor sequences”. The tumour suppressor sequences of the invention have been identified from a region of LOH seen in breast cancer, as well as other carcinomas including prostate tumours. Combined with the knowledge that these tumour suppressor sequences confer functional properties potentially linked with cancer to the proteins with which they encode suggests they are tumour suppressor genes playing a contributory role in cancer. The tumour suppressor sequences of the invention are described in Table 1 and are represented by SEQ ID NO:1 to 26.

[0015] The present invention also encompasses isolated nucleic acid and/or amino acid sequences which are homologous to the tumour suppressor sequences described above. Such homology is based on the overall nucleic acid or amino acid sequence of the group described in Table 1 and represented by the SEQ ID NO:1 to 26 and is determined using either homology programs or hybridisation conditions as outlined below.

[0016] A nucleic acid or protein is a tumour suppressor nucleic acid or protein if the overall homology of the nucleic acid or protein sequence to one of the sequences described in Table 1 and represented by the SEQ ID NO:1 to 26 is at least 70%, preferably 85% and most preferably 95%. Homology in this context means sequence similarity or identity, with identity being preferred.

[0017] In a preferred embodiment, the sequences which are used to determine sequence identity or similarity are selected from the sequences described in Table 1 and represented by the SEQ ID NO:1 to 26 or are naturally occurring allelic variants, sequence variants or splice variants of these sequences.

[0018] Sequence identity is typically calculated using the BLAST algorithm, described in Altschul et al Nucleic Acids Res. 25, 3389-3402 (1997) with the BLOSUM62 default matrix.

[0019] In one embodiment, nucleic acid homology can be determined through hybridisation studies. Nucleic acids which hybridise under stringent conditions to the nucleic acids of the invention are considered breast cancer sequences. Under stringent conditions, hybridisation will most preferably occur at 42° C. in 750 mM NaCl, 75 mM trisodium citrate, 2% SDS, 50% formamide, 1× Denhart's, 10% (w/v) dextran sulphate and 100 μg/ml denatured salmon sperm DNA. Useful variations on these conditions will be readily apparent to those skilled in the art. The washing steps which follow hybridization most preferably occur at 65° C. in 15 mM NaCl, 1.5 mM trisodium citrate, and 1% SDS. Additional variations on these conditions will be readily apparent to those skilled in the art.

[0020] In a further aspect, the invention provides tumour suppressor sequences, as described in Table 1 and represented by the SEQ ID NO:1 to 26, or the nucleotide sequence of a nucleic acid which hybridises thereto as described above, and appropriate control elements of the tumour suppressor sequences.

[0021] Preferably the control elements are those which mediate expression in breast tissue, but may also mediate expression in other tissues including, but not restricted to, prostate, liver and ovary. The tumour suppressor nucleic acid sequences of the present invention can be engineered using methods accepted in the art so as to alter the sequences for a variety of purposes. These include, but are not limited to, modification of the cloning, processing, and/or expression of the gene product. PCR reassembly of gene fragments and the use of synthetic oligonucleotides allow the engineering of breast cancer sequences of the invention. For example, oligonucleotide-mediated site-directed mutagenesis can introduce mutations that create new restriction sites, alter glycosylation patterns and produce splice variants etc.

[0022] As a result of the degeneracy of the genetic code, a number of polynucleotide sequences encoding tumour suppressor proteins of the invention, some that may have minimal similarity to the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the invention includes each and every possible variation of polynucleotide sequence that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally occurring breast cancer sequences, and all such variations are to be considered as being specifically disclosed.

[0023] The polynucleotides of this invention include RNA, cDNA, genomic DNA, synthetic forms, and mixed polymers, both sense and antisense strands, and may be chemically or biochemically modified, or may contain non-natural or derivatised nucleotide bases as will be appreciated by those skilled in the art. Such modifications include labels, methylation, intercalators, alkylators and modified linkages. In some instances it may be advantageous to produce nucleotide sequences encoding tumour suppressor sequences of the invention, or their derivatives, possessing a substantially different codon usage than that of the naturally occurring gene. For example, codons may be selected to increase the rate of expression of the peptide in a particular prokaryotic or eukaryotic host corresponding with the frequency that particular codons are utilized by the host. Other reasons to alter the nucleotide sequence encoding tumour suppressor sequences of the invention, or their derivatives, without altering the encoded amino acid sequences include the production of RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced from the naturally occurring sequence.

[0024] The invention also encompasses production of tumour suppressor sequences of the invention entirely by synthetic chemistry. Synthetic sequences may be inserted into expression vectors and cell systems that contain the necessary elements for transcriptional and translational control of the inserted coding sequence in a suitable host. Numerous types of appropriate expression vectors and suitable regulatory elements are known in the art for a variety of host cells. Regulatory elements may include regulatory, sequences, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, 5′ and 3′ untranslated regions and specific translational start and stop signals (such as an ATG initiation codon and Kozak consensus sequence). Regulatory elements will allow more efficient translation of sequences encoding breast cancer genes of the invention. In cases where the complete tumour suppressor coding sequence including its initiation codon and upstream regulatory sequences are inserted into the appropriate expression vector, additional control signals may not be needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals as described above should be provided by the vector. Such signals may be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used (Scharf et al., 1994).

[0025] The present invention allows for the preparation of purified tumour suppressor polypeptide or protein, from the polynucleotides of the present invention or variants thereof. In order to do this, host cells may be transfected with a nucleic acid molecule as described above. Typically said host cells are transfected with an expression vector comprising a nucleic acid encoding a tumour suppressor protein according to the invention. Cells are cultured under the appropriate conditions to induce or cause expression of the tumour suppressor protein. The conditions appropriate for protein expression will vary with the choice of the expression vector and the host cell, and will be easily ascertained by one skilled in the art.

[0026] A variety of expression vector/host systems may be utilized to contain and express the tumour suppressor sequences of the invention and are well known in the art. These include, but are not limited to, microorganisms such as bacteria transformed with plasmid or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); or mouse or other animal or human tissue cell systems. In a preferred embodiment the tumour suppressor proteins of the invention are expressed in mammalian cells using various expression vectors including plasmid, cosmid and viral systems such as adenoviral, retroviral or vaccinia virus expression systems. The invention is not limited by the host cell employed.

[0027] The polynucleotide sequences, or variants thereof, of the present invention can be stably expressed in cell lines to allow long-term production of recombinant proteins in mammalian systems. These sequences can be transformed into cell lines using expression vectors which may contain viral origins of replication and/or endogenous expression elements and a selectable marker gene on the same or on a separate vector. The selectable marker confers resistance to a selective agent, and its presence allows growth and recovery of cells which successfully express the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue culture techniques appropriate to the cell type.

[0028] The protein produced by a transformed cell may be secreted or retained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides which encode a protein of the invention may be designed to contain signal sequences which direct secretion of the protein through a prokaryotic or eukaryotic cell membrane.

[0029] In addition, a host cell strain may be chosen for its ability to modulate expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the polypeptide include, but are not limited to, acetylation, glycosylation, phosphorylation, and acylation. Post-translational cleavage of a “prepro” form of the protein may also be used to specify protein targeting, folding, and/or activity. Different host cells having specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO or HeLa cells), are available from the American Type Culture Collection (ATCC) and may be chosen to ensure the correct modification and processing of the foreign protein.

[0030] When large quantities of protein are needed such as for antibody production, vectors which direct high levels of expression of the breast cancer sequences may be used such as those containing the T5 or T7 inducible bacteriophage promoter. The present invention also includes the use of the expression systems described above in generating and isolating fusion proteins which contain the important functional domains of the protein. These fusion proteins are used for binding, structural and functional studies as well as for the generation of appropriate antibodies.

[0031] In order to express and purify the protein as a fusion protein, the appropriate cDNA sequence is inserted into a vector which contains a nucleotide sequence encoding another peptide (for example, glutathionine succinyl transferase). The fusion protein is expressed and recovered from prokaryotic or eukaryotic cells. The fusion protein can then be purified by affinity chromatography based upon the fusion vector sequence. The relevant protein can subsequently be obtained by enzymatic cleavage of the fusion protein.

[0032] In one embodiment, a fusion protein may be generated by the fusion of a tumour suppressor polypeptide with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed at the amino- or carboxy-terminus of the tumour suppressor polypeptide. The presence of such epitope-tagged forms of a tumour suppressor polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the tumour suppressor polypeptide to be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds to the epitope tag.

[0033] Various tag polypeptides and their respective antibodies are well known in the art. Examples include poly-histidine or poly-histidine-glycine tags and the c-myc tag and antibodies thereto.

[0034] Fragments of tumour suppressor polypeptide may also be produced by direct peptide synthesis using solid-phase; techniques. Automated synthesis may be achieved by using the ABI 431A Peptide Synthesizer (Perkin-Elmer). Various fragments of breast cancer polypeptide may be synthesized separately and then combined to produce the full-length molecule.

[0035] In a further aspect of the invention there is provided a method of preparing a polypeptide as described above, comprising the steps of:

[0036] (1) culturing the host cells under conditions effective for production of the polypeptide; and

[0037] (2) harvesting the polypeptide.

[0038] Substantially purified tumour suppressor proteins or fragments thereof can then be used in further biochemical analyses to establish secondary and tertiary structure for example by x-ray crystallography of the protein or by nuclear magnetic resonance (NMR). Determination of structure allows for the rational design of pharmaceuticals to interact with the protein, alter protein charge configuration or charge interaction with other proteins, or to alter its function in the cell.

[0039] According to the invention there is provided a method for the diagnosis of a predisposition to cancer, in a patient, comprising the steps of:

[0040] (1) establishing the level of expression of a gene selected from the group consisting of PR domain containing 7 protein (PRDM7), CDT1 DNA replication factor (CDT1), charged multivesicular body protein 1/chromoatin modifying protein 1 (CHMP1), BTG3 associated nucleoprotein (BANP), BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44; and

[0041] (2) comparing expression of the gene to a baseline established from expression in normal tissue controls;

[0042] wherein substantial variance from the baseline indicates that the patient is susceptible to cancer.

[0043] The tumour suppressor sequences of the present invention have been identified from a region of restricted LOH seen in breast cancer. In addition, these tumour suppressor sequences have been shown to confer functional properties potentially linked with cancer to the proteins with which they encode. As many of these genes are expressed in a wide variety of tissues and LOH of 16q has been found in cancers of other tissue types, including prostate, liver, ovary, primitive neuroectodermal and Wilms' tumours, suggests they may represent tumour suppressor genes involved in a variety of cancers. Such cancers may include, but are not limited to adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the breast, prostate, blood, germ cells, liver, ovary, adrenal gland, cervix, heart, brain, lung, placenta, skeletal muscle, synovial membrane, tonsil, lymph tissue, kidney, colon, uterus, skin and testis. Other cancers may include those of the head and neck, bladder, bone, bone marrow, gall bladder, ganglia, gastrointestinal tract, pancreas, parathyroid, penis, salivary glands, spleen, stomach, thymus and thyroid gland.

[0044] With the identification of the tumour suppressor nucleotide and protein sequences of the invention, probes and antibodies raised to the genes can be used in a variety of hybridisation and immunological assays to screen for and detect the presence of either a normal or mutated gene or gene product.

[0045] The nucleotide and protein sequences of the tumour suppressor genes provided in this invention also enable therapeutic methods for the treatment of cancers associated with one or more of these genes, and enable methods for the diagnosis or prognosis of all cancers associated with the these genes. Examples of such cancers include, but are not limited to, those listed above.

[0046] Due to their recessive nature, both copies of a tumour suppressor gene within a cell need to be inactivated for that cell to be affected. Therefore, in the treatment of cancers associated with inactivated or decreased tumour suppressor gene activity and/or expression, it is desirable to increase the activity and/or expression of the relevant gene.

[0047] Enhancing Tumour Suppressor Gene or Protein Function

[0048] Enhancing, stimulating or re-activating the function of those tumour suppressor genes or proteins that are mutated or down-regulated in cancer can be achieved in a variety of ways as would be appreciated by those skilled in the art.

[0049] In a preferred embodiment a tumour suppressor gene of the invention is administered to a subject to treat or prevent a cancer associated with decreased activity and/or expression of the gene.

[0050] In a further aspect, there is provided the use of a nucleic acid molecule of the invention, as described above, in the manufacture of a medicament for the treatment of a cancer associated with decreased activity and/or expression of the corresponding gene.

[0051] Typically, a vector capable of expressing a tumour suppressor gene of the invention, or fragment or derivative thereof, may be administered to a subject to treat or prevent a cancer associated with decreased activity and/or expression of the gene, including but not limited to, those described above.

[0052] Transducing retroviral vectors are often used for somatic cell gene therapy because of their high efficiency of infection and stable integration and expression. The full-length breast cancer gene, or portions thereof, can be cloned into a retroviral vector and expression can be driven from its endogenous promoter or from the retroviral long terminal repeat or from a promoter specific for the target cell type of interest. Other viral vectors can be used and include, as is known in the art, adenoviruses, adeno-associated virus, vaccinia virus, papovaviruses, lentiviruses and retroviruses of avian, murine and human origin.

[0053] Gene therapy would be carried out according to established methods (Friedman, 1991; Culver, 1996). A vector containing a copy of a tumour suppressor gene linked to expression control elements and capable of replicating inside the cells is prepared. Alternatively the vector may be replication deficient and may require helper cells or helper virus for replication and virus production and use in gene therapy.

[0054] Gene transfer using non-viral methods of infection can also be used. These methods include direct injection of DNA, uptake of naked DNA in the presence of calcium phosphate, electroporation, protoplast fusion or liposome delivery. Gene transfer can also be achieved by delivery as a part of a human artificial chromosome or receptor-mediated gene transfer. This involves linking the DNA to a targeting molecule that will bind to specific cell-surface receptors to induce endocytosis and transfer of the DNA into mammalian cells. One such technique uses poly-L-lysine to link asialoglycoprotein to DNA. An adenovirus is also added to the complex to disrupt the lysosomes and thus allow the DNA to avoid degradation and move to the nucleus. Infusion of these particles intravenously has resulted in gene transfer into hepatocytes.

[0055] In affected subjects that express a mutated form of a tumour suppressor gene of the invention, it may be possible to prevent the cancer by introducing into the affected cells a wild-type copy of the gene such that it recombines with the mutant gene. This requires a double recombination event for the correction of the gene mutation. Vectors for the introduction of genes in these ways are known in the art, and any suitable vector may be used. Alternatively, introducing another copy of the gene bearing a second mutation in that gene may be employed so as to negate the original gene mutation and block any negative effect.

[0056] In a still further aspect the invention provides a method for the treatment of a cancer associated with decreased activity and/or expression of a tumour suppressor gene of the invention, comprising administering a polypeptide as described above, or an agonist thereof, to a subject in need of such treatment.

[0057] In another aspect the invention provides the use of a polypeptide as described above, or an agonist thereof, in the manufacture of a medicament for the treatment of a cancer associated with decreased activity and/or expression of a tumour suppressor gene.

[0058] Diagnostic and Prognostic Applications

[0059] Polynucleotide sequences encoding the tumour suppressor genes of the invention may be used for the diagnosis or prognosis of cancers associated with their dysfunction, or a predisposition to such cancers. Examples of such cancers include, but are not limited to, adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the breast, prostate, blood, germ cells, liver, ovary, adrenal gland, cervix, heart, brain, lung, placenta, skeletal muscle, synovial membrane, tonsil, lymph tissue, kidney, colon, uterus, skin and testis. Other cancers may include those of the head and neck, bladder, bone, bone marrow, gall bladder, ganglia, gastrointestinal tract, pancreas, parathyroid, penis, salivary glands, spleen, stomach, thymus and thyroid gland.

[0060] Diagnosis or prognosis may be used to determine the severity, type or stage of the disease state in order to initiate an appropriate therapeutic intervention.

[0061] In another embodiment of the invention, the polynucleotides that may be used for diagnostic or prognostic purposes include oligonucleotide sequences, genomic DNA and complementary RNA and DNA molecules. The polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in which mutations or abnormal expression of the relevant tumour suppressor gene may be correlated with disease. Genomic DNA used for the diagnosis or prognosis may be obtained from body cells, such as those present in the blood, tissue biopsy, surgical specimen, or autopsy material. The DNA may be isolated and used directly for detection of a specific sequence or may be amplified by the polymerase chain reaction (PCR) prior to analysis. Similarly, RNA or cDNA may also be used, with or without PCR amplification. To detect a specific nucleic acid sequence, direct nucleotide sequencing, reverse transcriptase PCR (RT-PCR), hybridization using specific oligonucleotides, restriction enzyme digest and mapping, PCR mapping, RNAse protection, and various other methods may be employed. Oligonucleotides specific to particular sequences can be chemically synthesized and labelled radioactively or non-radioactively and hybridised to individual samples immobilized on membranes or other solid-supports or in solution. The presence or absence of a particular target tumour suppressor may then be visualized using methods such as autoradiography, fluorometry, or colorimetry.

[0062] In a particular aspect, the nucleotide sequences encoding a tumour suppressor gene of the invention may be useful in assays that detect the presence of associated disorders, particularly those mentioned previously. The nucleotide sequences encoding the relevant tumour suppressor gene may be labelled by standard methods and added to a fluid or tissue sample from a patient under conditions suitable for the formation of hybridization complexes. After a suitable incubation period, the sample is washed and the signal is quantitated and compared with a standard value. If the amount of signal in the patient sample is significantly altered in comparison to a control sample then the presence of altered levels of nucleotide sequences encoding the tumour suppressor gene in the sample indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual patient.

[0063] In order to provide a basis for the diagnosis or prognosis of a disorder associated with a mutation in a particular tumour suppressor gene of the invention, the nucleotide sequence of the relevant gene can be compared between normal tissue and diseased tissue in order to establish whether the patient expresses a mutant gene.

[0064] In order to provide a basis for the diagnosis or prognosis of a disorder associated with decreased expression of a particular tumour suppressor gene of the invention, a normal or standard profile for expression is established. This may be accomplished by combining body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a fragment thereof, encoding the relevant breast cancer gene, under conditions suitable for hybridization or amplification. Standard hybridization may be quantified by comparing the values obtained from normal subjects with values from an experiment in which a known amount of a substantially purified polynucleotide is used. Another method to identify a normal or standard profile for expression of a particular tumour suppressor gene is through quantitative RT-PCR studies. RNA isolated from body cells of a normal individual, particularly RNA isolated from tumour cells, is reverse transcribed and real-time PCR using oligonucleotides specific for the relevant tumour suppressor gene are conducted to establish a normal level of expression of the gene.

[0065] Standard values obtained in both these examples may be compared with values obtained from samples from patients who are symptomatic for a disorder. Deviation from standard values is used to establish the presence of a disorder.

[0066] Once the presence of a disorder is established and a treatment protocol is initiated, hybridization assays or quantitative RT-PCR studies may be repeated on a regular basis to determine if the level of expression in the patient begins to approximate that which is observed in the normal subject. The results obtained from successive assays may be used to show the efficacy of treatment over a period ranging from several days to months.

[0067] In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide sequences, including genomic sequences, encoding a particular tumour suppressor gene, or closely related molecule, may be used to identify nucleic acid sequences which encode the gene. The specificity of the probe, whether it is made from a highly specific region, e.g., the 5′ regulatory region, or from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or amplification will determine whether the probe identifies only naturally occurring sequences encoding the tumour suppressor gene, allelic variants, or related sequences.

[0068] Probes may also be used for the detection of related sequences, and should preferably have at least 50% sequence identity to any of the tumour suppressor encoding sequences. The hybridization probes of the subject invention may be DNA or RNA and may be derived from the sequence of SEQ ID Numbers: 1-26 or from genomic sequences including promoters, enhancers, and introns of the genes.

[0069] Means for producing specific hybridization probes for DNAs encoding the tumour suppressor genes of the invention include the cloning of polynucleotide sequences encoding these genes or their derivatives into vectors for the production of mRNA probes. Such vectors are known in the art, and are commercially available. Hybridization probes may be labelled by radionuclides such as ³²P or ³⁵S, or by enzymatic labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, or other methods known in the art.

[0070] According to a further aspect of the invention there is provided the use of a polypeptide as described above in the diagnosis or prognosis of a cancer associated with a tumour suppressor gene of the invention, or a predisposition to such cancers.

[0071] When a diagnostic or prognostic assay is to be based upon a tumour suppressor protein, a variety of approaches are possible. For example, diagnosis or prognosis can be achieved by monitoring differences in the electrophoretic mobility of normal and mutant proteins. Such an approach will be particularly useful in identifying mutants in which charge substitutions are present, or in which insertions, deletions or substitutions have resulted in a significant change in the electrophoretic migration of the resultant protein. Alternatively, diagnosis may be based upon differences in the proteolytic cleavage patterns of normal and mutant proteins, differences in molar ratios of the various amino acid residues, or by functional assays demonstrating altered function of the gene products.

[0072] In another aspect, antibodies that specifically bind a tumour suppressor protein of the invention may be used for the diagnosis or prognosis of cancers characterized by reduced activity and/or expression of the gene, or in assays to monitor patients being treated with the gene. Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric and single chain antibodies as would be understood by the person skilled in the art.

[0073] For the production of antibodies, various hosts including rabbits, rats, goats, mice, humans, and others may be immunized by injection with a protein of the invention or with any fragment or oligopeptide thereof, which has immunogenic properties. Various adjuvants may be used to increase immunological response and include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface-active substances such as lysolecithin. Adjuvants used in humans include BCG (bacilli Calmette-Guerin) and Corynebacterium parvum.

[0074] It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to the tumour suppressor proteins of the invention have an amino acid sequence consisting of at least about 5 amino acids, and, more preferably, of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are identical to a portion of the amino acid sequence of the natural protein and contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of amino acids from these proteins may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule may be produced.

[0075] Monoclonal antibodies to tumour suppressor proteins of the invention may be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique. (For example, see Kohler et al., 1975; Kozbor et al., 1985; Cote et al., 1983; Cole et al., 1984).

[0076] Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in the literature. (For example, see Orlandi et al., 1989; Winter et al., 1991).

[0077] Antibody fragments which contain specific binding sites for the tumour suppressor proteins may also be generated. For example, such fragments include, F(ab′)₂ fragments produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab′)₂ fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity. (For example, see Huse et al., 1989).

[0078] Various immunoassays may be used for screening to identify antibodies having the desired specificity. Numerous protocols for competitive binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art. Such immunoassays typically involve the measurement of complex formation between a protein and its specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes is preferred, but a competitive binding assay may also be employed.

[0079] Diagnostic or prognostic assays based on antibodies generated to the tumour suppressor proteins of the invention include methods that utilize the antibody and a label which will detect binding of the antibody to the appropriate protein in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and may be labelled by covalent or non-covalent attachment of a reporter molecule.

[0080] A variety of protocols for measuring antibody binding include ELISAS, RIAS, and FACS, and are known in the art. These methods provide a basis for diagnosing altered or abnormal levels of tumour suppressor gene expression. Normal or standard values for tumour suppressor gene expression are established by combining body fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to the appropriate tumour suppressor protein under conditions suitable for complex formation. The amount of standard complex formation may be quantitated by various methods, preferably by photometric means. Quantities of any of the tumour suppressor genes expressed in subject, control, and disease samples from biopsied tissues are compared with the standard values. Deviation between standard and subject values establishes the parameters for diagnosing disease.

[0081] Once an individual has been diagnosed with a cancer, effective treatments can be initiated. These may include administering a selective agonist to the relevant mutant tumour suppressor gene so as to restore its function to a normal level or introduction of the wild-type gene, particularly through gene therapy approaches as described above. Typically, a vector capable of expressing the appropriate full-length tumour suppressor gene or a fragment or derivative thereof may be administered. In an alternative approach to therapy, a substantially purified breast cancer polypeptide and a pharmaceutically acceptable carrier may be administered, as described above.

[0082] The nucleotide and protein sequences of the invention provide the ability to identify proteins that interact with these sequences. Interacting proteins may give an insight into the biological pathways in which the tumour suppressor proteins participate. In turn, proteins within these pathways may provide suitable targets for therapeutic applications.

[0083] The activity and/or expression of proteins within these pathways can subsequently be modulated in a number of ways so as to ultimately mimic the action that the wild-type tumour suppressor gene normally plays within the pathway. Methods include administering an antagonist of the proteins within the pathway to a subject in need of such treatment, such as an antibody designed to the protein or small molecule interactor, or through the use of antisense technologies such as antisense oligonucleotides, ribozymes, DNAzymes, injection of antisense RNA and transfection of antisense RNA expression vectors.

[0084] Drug Screening

[0085] The tumour suppressor genes of the invention may be used to construct eukaryotic cell lines which carry mutations in a particular gene of interest. The host cell lines are also defective at the polypeptide level. Other cell lines may be used where the expression of the tumour suppressor gene of interest can be switched off. The host cell lines or cells are grown in the presence of various drug compounds and the rate of growth of the host cells is measured to determine if the compound is capable of regulating the growth of defective cells.

[0086] Candidate pharmaceutical agents or compounds encompass numerous chemical classes, though typically they are organic molecules, preferably small organic compounds having molecular weight of more than 100 and less than about 2,500 daltons. Candidate agents are also found among biomolecules including peptides, saccharides, fatty acids and steroids. In this case peptides are preferred.

[0087] Compounds developed as a result of combinatorial library technology may also be screened. This provides a way to test a large number of different substances for their ability to restore wild-type function to the cells which contain a dysfunctional tumour suppressor gene. The use of peptide libraries is preferred (see patent WO97/02048) with such libraries and their use known in the art.

[0088] A substance identified as a modulator of cell growth and function may be peptide or non-peptide in nature. Non-peptide “small molecules” are often preferred for many in vivo pharmaceutical applications. In addition, a mimic or mimetic of the substance may be designed for pharmaceutical use. The design of mimetics based on a known pharmaceutically active compound (“lead” compound) is a common approach to the development of novel pharmaceuticals. This is often desirable where the original active compound is difficult or expensive to synthesise or where it provides an unsuitable method of administration. In the design of a mimetic, particular parts of the original active compound that are important in determining the target property are identified. These parts or residues constituting the active region of the compound are known as its pharmacophore. Once found, the pharmacophore structure is modelled according to its physical properties using data from a range of sources including x-ray diffraction data and NMR. A template molecule is then selected onto which chemical groups which mimic the pharmacophore can be added. The selection can be made such that the mimetic is easy to synthesise, is likely to be pharmacologically acceptable, does not degrade in vivo and retains the biological activity of the lead compound. Further optimisation or modification can be carried out to select one or more final mimetics useful for in vivo or clinical testing.

[0089] It is also possible to isolate a target-specific antibody and then solve its crystal structure. In principle, this approach yields a pharmacophore upon which subsequent drug design can be based as described above. It may be possible to avoid protein crystallography altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, the binding site of the anti-ids would be expected to be an analogue of the original binding site. The anti-id could then be used to isolate peptides from chemically or biologically produced peptide banks.

[0090] Proteins involved in the tumour suppressor gene pathway may also be used for the screening of candidate pharmaceutical agents or compounds that interact with the proteins, for the treatment of cancers associated with the tumour suppressor dysfunction.

[0091] Agent screening techniques may include utilising eukaryotic or prokaryotic host cells that are stably transformed with recombinant molecules expressing a particular tumour suppressor pathway polypeptide preferably in competitive binding assays. Binding assays will measure for the formation of complexes between the polypeptide of interest and the agent being tested, or will measure the degree to which an agent being tested will interfere with the formation of a complex between the polypeptide of interest and a known ligand.

[0092] Another technique for drug screening provides high-throughput screening for compounds having suitable binding affinity to a tumour suppressor pathway polypeptide !of interest (see PCT published application WO84/03564). In this stated technique, large numbers of small peptide test compounds can be synthesised on a solid substrate and can be assayed through polypeptide binding and washing. Bound polypeptide is then detected by methods well known in the art. In a variation of this technique, purified polypeptides can be coated directly onto plates to identify interacting test compounds.

[0093] A tumour suppressor pathway polypeptide of interest may also be used for screening compounds developed as a result of combinatorial library technology as described above.

[0094] In a further aspect a pharmaceutical composition and a pharmaceutically acceptable carrier may be administered. The pharmaceutical composition may comprise any one or more of a polypeptide as described above, typically a substantially purified tumour suppressor polypeptide, an antibody to a tumour suppressor polypeptide, a vector capable of expressing a tumour suppressor polypeptide, a compound which increases expression of a tumour suppressor gene or a candidate drug that restores wild-type activity to a tumour suppressor gene.

[0095] The pharmaceutical composition may be administered to a subject to treat or prevent a cancer associated with decreased activity and/or expression of a tumour suppressor gene including, but not limited to, those provided above. Pharmaceutical compositions in accordance with the present invention are prepared by mixing a polypeptide of the invention, or active fragments or variants thereof, having the desired degree of purity, with acceptable carriers, excipients, or stabilizers, which are well known. Acceptable carriers, excipients or stabilizers are nontoxic at the dosages and concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants including absorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitrol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as Tween, Pluronics or polyethylene glycol (PEG).

[0096] Any of the therapeutic methods described above may be applied to any subject in need of such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans.

[0097] Microarray

[0098] In further embodiments, complete cDNAs, oligonucleotides or longer fragments derived from any of the polynucleotide sequences described herein may be used as targets in a microarray. The microarray can be used to monitor the expression level of large numbers of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. This information may be used to determine gene function, to understand the genetic basis of a disorder, to diagnose or prognose a disorder, and to develop and monitor the activities of therapeutic agents. Microarrays may be prepared, used, and analyzed using methods known in the art. (For example, see Schena et al., 1996; Heller et al., 1997).

[0099] Transformed Hosts

[0100] The present invention also provides for the production of genetically modified (knock-out, knock-in and transgenic), non-human animal models transformed with the DNA molecules of the invention. These animals are useful for the study of tumour suppressor gene function, to study the mechanisms of cancer as related to the tumour suppressor genes, for the screening of candidate pharmaceutical compounds, for the creation of explanted mammalian cell cultures which express the protein or mutant protein and for the evaluation of potential therapeutic interventions.

[0101] One of the tumour suppressor genes of the invention may have been inactivated by knock-out deletion, and knock-out genetically modified non-human animals are therefore provided.

[0102] Animal species which are suitable for use in the animal models of the present invention include, but are not limited to, rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non-human primates such as monkeys and chimpanzees. For initial studies, genetically modified mice and rats are highly desirable due to their relative ease of maintenance and shorter life spans. For certain studies, transgenic yeast or invertebrates may be suitable and preferred because they allow for rapid screening and provide for much easier handling. For longer-term studies, non-human primates may be desired due to their similarity with humans.

[0103] To create an animal model for a mutated tumour suppressor gene several methods can be employed. These include generation of a specific mutation in a homologous animal gene, insertion of a wild type human gene and/or a humanized animal gene by homologous recombination, insertion of a mutant (single or multiple) human gene as genomic or minigene cDNA constructs using wild type or mutant or artificial promoter elements or insertion of artificially modified fragments of the endogenous gene by homologous recombination. The modifications include insertion of mutant stop codons, the deletion of DNA sequences, or the inclusion of recombination elements (lox p sites) recognized by enzymes such as Cre recombinase.

[0104] To create a transgenic mouse, which is preferred, a mutant version of a particular breast cancer gene of the invention can be inserted into a mouse germ line using standard techniques of oocyte microinjection or transfection or microinjection into embryonic stem cells. Alternatively, if it is desired to inactivate or replace the endogenous breast cancer gene, homologous recombination using embryonic stem cells may be applied.

[0105] For oocyte injection, one or more copies of the mutant or wild type tumour suppressor gene can be inserted into the pronucleus of a just-fertilized mouse oocyte. This oocyte is then reimplanted into a pseudo-pregnant foster mother. The liveborn mice can then be screened for integrants using analysis of tail DNA for the presence of human tumour suppressor gene sequences. The transgene can be either a complete genomic sequence injected as a YAC, BAC, PAC or other chromosome DNA fragment, a cDNA with either the natural promoter or a heterologous promoter, or a minigene containing all of the coding region and other elements found to be necessary for optimum expression.

[0106] According to still another aspect of the invention there is provided the use of genetically modified non-human animals as described above for the screening of candidate pharmaceutical compounds.

[0107] It will be clearly understood that, although a number of prior art publications are referred to herein, this reference does not constitute an admission that any of these documents forms part of the common general knowledge in the art, in Australia or in any other country. Throughout this specification and the claims, the words “comprise”, “comprises” and “comprising” are used in a non-exclusive sense, except where the context requires otherwise.

BRIEF DESCRIPTION OF THE DRAWINGS

[0108]FIG. 1. Schematic representation of tumours with interstitial and terminal allelic loss on chromosome arm 16q in the two series of tumour samples. Polymorphic markers are listed according to their order on 16q from centromere to telomere and the markers used for each series are indicated by X. Tumour identification numbers are shown at the top of each column. At the right of the figure, the three smallest regions of loss of heterozygosity are indicated.

MODES FOR PERFORMING THE INVENTION EXAMPLE 1 Collection of Breast Cancer Patient Material

[0109] Two series of breast cancer patients were analysed for this study. Histopathological classification of each tumour specimen was carried out by our collaborators according to World Health Organisation criteria (WHO, 1981). Patients were graded histopathologically according to the modified Bloom and Richardson method (Elston and Ellis, 1990) and patient material was obtained upon approval of local Medical Ethics Committees. Tumour tissue DNA and peripheral blood DNA from the same individual was isolated as previously described (Devilee et al., 1991) using standard laboratory protocols.

[0110] Series 1 consisted of 189 patients operated on between 1986 and 1993 in three Dutch hospitals, a Dutch University and two peripheral centres. Tumour tissue was snap frozen within a few hours of resection. For DNA isolation, a tissue block was selected only if it contained at least 50% of tumour cells following examination of haematoxilin and eosin stained tissue sections by a pathologist. Tissue blocks that contained fewer than 50% of tumour cells were omitted from further analysis.

[0111] Series 2 consisted of 123 patients operated on between 1987 and 1997 at the Flinders Medical Centre in Adelaide, Australia. Of these, 87 were collected as fresh specimens within a few hours of surgical resection, confirmed as malignant tissue by pathological analysis, snap frozen in liquid nitrogen, and stored at −70° C. The remaining 36 tumour tissue samples were obtained from archival paraffin embedded tumour blocks. Prior to DNA isolation, tumour cells were microdissected from tissue sections mounted on glass slides so as to yield at least 80% tumour cells. In some instances, no peripheral blood was available such that pathologically identified paraffin embedded non-malignant lymph node tissue was used instead.

EXAMPLE 2 LOH Analysis of Chromosome 16q Markers in Breast cancer samples

[0112] In order to identify the location of tumour suppressor genes associated with breast cancer, LOH analysis of tumour samples was conducted. A total of 45 genetic markers mapping to chromosome 16 were used for the LOH analysis of the breast tumour and matched normal DNA samples collected for this study. FIG. 1 indicates for which tumour series they were used and their cytogenetic location. Details regarding all markers can be obtained from the Genome Database (GDB) at http://www.gdb.org. The physical order of markers with respect to each other was determined from a combination of information in GDB, by mapping on a chromosome 16 somatic cell hybrid map (Callen et al., 1995) and by genomic sequence information.

[0113] Four alternative methods were used for the LOH analysis:

[0114] 1) For RFLP and VNTR markers, Southern blotting was used to test for allelic imbalance. These markers were used on only a subset of samples. Methods used were as previously described (Devilee et al., 1991).

[0115] 2) Microsatellite markers were amplified from tumour and normal DNA using the polymerase chain reaction (PCR) incorporating standard methodologies (Weber and May, 1989; Sambrook et al., 1989). A typical reaction consisted of 12 ul and contained 100 ng of template, 5 pmol of both primers, 0.2 mM of each dNTP, 1 μcurie [α-³²P]dCTP, 1.5 mM MgCl₂, 1.2 ul Supertag buffer and 0.06 units of Supertag (HT biotechnologies). A Phosphor Imager type 445 SI (Molecular Dynamics, Sunnyvale, Calif.) was used to quantify ambiguous results. In these cases, the Allelic Imbalance Factor (AIF) was determined as the quotient of the peak height ratios from the normal and tumour DNA pair. The threshold for allelic imbalance was defined as a 40% reduction of one allele, agreeing with an AIF of ≧1.7 or ≦0.59. This threshold is in accordance with the selection of tumour tissue blocks containing at least 50% tumour cells with a 10% error-range. The threshold for retention has been previously determined to range from 0.76 to 1.3 (Devilee et al., 1994). This leaves a range of AIFs (0.58-0.75 and 1.31-1.69) for which no definite decision has been made. This “grey area” is indicated by grey boxes in FIG. 1 and tumours with only “grey area” values were discarded completely from the analysis.

[0116] 3) The third method for determining allelic imbalance was similar to the second method above, however radioactively labelled dCTP was omitted. Instead, PCR of polymorphic microsatellite markers was done with one of the PCR primers labelled fluorescently with FAM, TET or HEX. Analysis of PCR products generated was on an ABI 377 automatic sequencer (PE Biosystems) using 6% polyacrylamide gels containing 8M urea. Peak height values and peak sizes were analysed with the GeneScan programme (PE Biosystems). The same thresholds for allelic imbalance, retention and grey areas were used as for the radioactive analysis.

[0117] 4) An alternative fluorescent-based system was also used. In this instance PCR primers were labelled with fluorescein or hexachlorofluorescein. PCR reaction volumes were 20 μl and included 100 ng of template, 100 ng of each primer, 0.2 mM of each dNTP, 1-2 mM MgCl₂, 1× AmpliTaq Gold buffer and 0.8 units AmpliTaq Gold enzyme (Perkin Elmer). Cycling conditions were 10 cycles of 94° C. for 30 seconds, 60° C. for 30 seconds, 72° C. for 1 minute, followed by 25 cycles of 94° C. 30 seconds, 55° C. for 30 seconds, 72° C. for 1 minute, with a final extension of 72° C. for 10 minutes. PCR amplimers were analysed on an ABI 373 automated sequencer (PE Biosystems) using the GeneScan programme (PE Biosystems). The threshold range of AIF for allele retention was defined as 0.61-1.69, allelic loss as ≦0.5 or ≧2.0, or the “grey area” as 051-0.6 or 1.7-1.99.

[0118] The first three methods were applied to the first tumour series while the last method was adopted for the second series of tumour samples. For statistical analysis, a comparison of allelic imbalance data for validation of the different detection methods and of the different tumour series was done using the Chi-square test.

[0119] The identification of the smallest region of overlap (SRO) involved in LOH is instrumental for narrowing down the location of the tumour suppressor gene targeted by LOH. FIG. 1 shows the LOH results for tumour samples, which displayed small regions of loss (ie interstitial and telomeric LOH) and does not include samples that showed complex LOH (alternating loss and retention of markers). When comparing the two sample sets at least three consistent regions emerge with two being at the telomere in band 16q24.3 and one at 16q22.1. The region at 16q22.1 is defined by the markers D16S398 and D16S301 and is based on the interstitial LOH events seen in three tumours from series 1 (239/335/478) and one tumour from series 2 (237). At the telomere (16q24.2-16q24.3), the first region is defined by the markers D16S498 and D16S3407 and is based on four tumours from series 2 (443/75/631/408) while the second region (16q24.3) extends from D16S3407 to the telomere and is based on one tumour from series 1 (559) and three from series 2 (97/240/466). LOH limited to the telomere but involving both of the regions identified at this site could be found in an additional 17 tumour samples.

[0120] Other studies have shown that the long arm of chromosome 16 is also a target for LOH in prostate, lung, hepatocellular, ovarian, rhabdomyosarcoma and Wilms' tumours. Detailed analysis of prostate carcinomas has revealed an overlap in the smallest regions of LOH seen in this cancer to that seen with breast cancer which suggests that 16q harbours a tumour suppressor gene implicated in many tumour types.

EXAMPLE 3 Construction of a Physical Map of 16q24.3

[0121] To identify novel candidate breast cancer tumour suppressor genes mapping to the smallest regions of overlap at 16q24.3, a clone based physical map contig covering this region was needed. At the start of this phase of the project the most commonly used and readily accessible cloned genomic DNA fragments were contained in lambda, cosmid or YAC vectors. During the construction of whole chromosome 16 physical maps, clones from a number of YAC libraries were incorporated into the map (Doggett et al., 1995). These included clones from a flow-sorted chromosome 16-specific YAC library (McCormick et al., 1993), from the CEPH Mark I and MegaYAC libraries and from a half-telomere YAC library (Riethman et al., 1989). Detailed STS and Southern analysis of YAC clones mapping at 16q24.3 established that very few were localised between the CY2/CY3 somatic cell hybrid breakpoint and the long arm telomere. However, those that were located in this region gave inconsistent mapping results and were suspected to be rearranged or deleted. Coupled with the fact that YAC clones make poor sequencing substrates, and the difficulty in isolating the cloned human DNA, a physical map based on cosmid clones was the initial preferred option.

[0122] A flow-sorted chromosome 16 specific cosmid library had previously been constructed (Longmire et al., 1993), with individual cosmid clones gridded in high-density arrays onto nylon membranes. These filters collectively contained ˜15,000 clones representing an approximately 5.5 fold coverage of chromosome 16. Individual cosmids mapping to the critical regions at 16q24.3 were identified by the hybridisation of these membranes with markers identified by this and previous studies to map to the region. The strategy to align overlapping cosmid clones was based on their STS content and restriction endonuclease digestion pattern. Those clones extending furthest within each initial contig were then used to walk along the chromosome by the hybridisation of the ends of these cosmids back to the high-density cosmid grids. This process continued until all initial contigs were linked and therefore the region defining the location of the breast cancer tumour suppressor genes would be contained within the map. Individual cosmid clones representing a minimum tiling path in the contig were then used for the identification of transcribed sequences by exon trapping, and for genomic sequencing.

[0123] Chromosome 16 was sorted from the mouse/human somatic cell hybrid CY18, which contains this chromosome as the only human DNA, and Sau3A partially digested CY18 DNA was ligated into the BamHI cloning site of the cosmid sCOS-1 vector. All grids were hybridised and washed using methods described in Longmire et al. (1993). Briefly, the 10 filters were pre-hybridised in 2 large bottles for at least 2 hours in 20 ml of a solution containing 6×SSC; 10 mM EDTA (pH 8.0); 10× Denhardt's; 1% SDS and 100 μg/ml denatured fragmented salmon sperm DNA at 65° C. Overnight hybridisations with [α-³²P]dCTP labelled probes were performed in 20 ml of fresh hybridisation solution at 65° C. Filters were washed sequentially in solutions of 2×SSC; 0.1% SDS (rinse at room temperature), 2×SSC; 0.1% SDS (room temperature for 15 minutes), 0.1×SSC; 0.1% SDS (room temperature for 15 minutes), and 0.1×SSC; 0.1% SDS (twice for 30 minutes at 50° C. if needed). Membranes were exposed at −70° C. for between 1 to 7 days.

[0124] Initial markers used for cosmid grid screening were those known to be located below the somatic cell hybrid breakpoints CY2/CY3 and the long arm telomere (Callen et al., 1995). These included three genes, CMAR, DPEP1, and MC1R; the microsatellite marker D16S303; an end fragment from the cosmid 317E5, which contains the BBC1 gene; and four cDNA clones, yc81e09, yh09a04, D16S532E, and ScDNA-C113. The IMAGE consortium cDNA clone, yc81e09, was obtained through screening an arrayed normalised infant brain oligo-dT primed cDNA library (Soares et al., 1994), with the insert from cDNA clone ScDNA-ASS. Both the ScDNA-A55 and ScDNA-C113 clones were originally isolated from a hexamer primed heteronuclear cDNA library constructed from the mouse/human somatic cell hybrid CY18 (Whitmore et al., 1994). The IMAGE cDNA clone yh09a04 was identified from direct cDNA selection of the cosmid 37B2 which was previously shown to map between the CY18A(D2) breakpoint and the 16q telomere. The EST, D16S532E, was also mapped to the same region. Subsequent to these initial screenings, restriction fragments representing the ends of cosmids were used to identify additional overlapping clones.

[0125] Contig assembly was based on methods previously described (Whitmore et al., 1998). Later during the physical map construction, genomic libraries cloned into BAC or PAC vectors (Genome Systems or Rosewell Park Cancer Institute) became available. These libraries were screened to aid in chromosome walking or when gaps that could not be bridged by using the cosmid filters were encountered. All BAC and PAC filters were hybridised and washed according to manufacturers recommendations. Initially, membranes were individually pre-hybridised in large glass bottles for at least 2 hours in 20 ml of 6×SSC; 0.5% SDS; 5× Denhardt's; 100 μg/ml denatured salmon sperm DNA at 65° C. Overnight hybridisations with [α-³²P]dCTP labelled probes were performed at 65° C. in 20 ml of a solution containing 6×SSC; 0.5% SDS; 100 μg/ml denatured salmon sperm DNA. Filters were washed sequentially in solutions of 2×SSC; 0.5% SDS (room temperature 5 minutes), 2×SSC; 0.1% SDS (room temperature 15 minutes) and 0.1×SSC; 0.5% SDS (37° C. 1 hour if needed). PAC or BAC clones identified were aligned to the existing contig based on their restriction enzyme pattern or formed unique contigs which were extended by additional filter screens.

[0126] A high-density physical map consisting of cosmid, BAC and PAC clones was established, which extended approximately 3 Mb from the telomere of the long arm of chromosome 16. This contig extends beyond the CY2/CY3 somatic cell hybrid breakpoint and includes the 2 regions of minimal LOH identified at the 16q24.3 region in breast cancer samples. To date, a single gap of unknown size exists in the contig and will be closed by additional contig extension experiments. The depth of coverage has allowed the identification of a minimal tiling path of clones which were subsequently used as templates for gene identification methods such as exon trapping and genomic DNA sequencing.

EXAMPLE 4 Identification of Candidate Tumour Suppressor Genes by Analysis of Genomic DNA Sequence

[0127] Selected minimal overlapping BAC and PAC clones from the physical map contig were sequenced in order to aid in the identification of candidate breast cancer genes. DNA was prepared from selected clones using a large-scale DNA isolation kit (Qiagen). Approximately 25-50 ug of DNA was then sheared by nebulisation (10 psi for 45 seconds) and blunt ended using standard methodologies (Sambrook et al., 1989). Samples were then run on an agarose gel in order to isolate DNA in the 2-4 Rb size range. These fragments were cleaned from the agarose using QIAquick columns (Qiagen), ligated into puc18 and used to transform competent DH10B or DH5a E. coli cells. DNA was isolated from transformed clones and was sequenced using vector specific primers on an ABI377 sequencer.

[0128] Analysis of genomic sequence was performed using PHRED, PHRAP and GAP4 software on a SUN workstation. To assist in the generation of large contigs of genomic sequence, information present in the high-throughput genomic sequence (htgs) database at NCBI was incorporated into the assembly phase of the sequence analysis. The resultant genomic sequence contigs were masked for repeats and analysed using the BLAST algorithm (Altschul et al., 1997) to identify nucleotide and protein homology to sequences in the GenBank non-redundant and EST databases at NCBI. The genomic sequence was also analysed for predicted gene structure using the GENSCAN program and specific screening of the mouse EST dataset was utilised to identify potential human orthologues that have poor representation in the human EST dataset.

[0129] Following the identification of homologous EST sequences, in silico cDNA walking experiments were initiated through further dbEST database screening. This was to identify overlapping cDNA sequences present in dbEST that would allow extension of the originally identified partial gene sequence. Overlapping EST sequences were assembled using the DNAStar LaserGene sequence assembly software. Homologous IMAGE cDNA clones in some instances were also purchased and sequenced. These longer stretches of sequence were then compared to known genes by nucleotide and amino acid sequence comparisons using the above procedures.

[0130] From in silico analysis of the dbEST database at NCBI using all genomic sequence obtained for the 16q24.3 critical LOH region, a total of 55 gene fragments or gene “signatures” were identified. In the majority of cases each novel gene fragment was represented by a distinct UniGene cluster composed of one or a number of overlapping cDNA clones. The majority of these UniGene clusters appeared to represent the 3′ untranslated regions of their representative gene as their sequence was continuous with the genomic sequence and further in silico manipulation failed to identify open reading frames representing amino acid coding regions.

[0131] As well as the 55 gene signatures that were identified in the 16q24.3 region analysed, a total of 48 partial or full-length genes were also present based on in silico analysis of the genomic DNA generated.

[0132] Those sequences that are expressed in the breast were considered to be the most likely candidate breast cancer genes. Those genes whose function could implicate it in the tumourigenic process, as predicted from homology searches with known proteins, were treated with the highest priority. Further evidence that a particular candidate is the responsible gene comes from the identification of defective alleles of the gene in affected individuals or from analysis of the expression levels of a particular candidate gene in breast cancer samples compared with normal control tissues.

[0133] Table 1 lists those genes or partial gene fragments that were identified to encode functional domains indicative of a potential role in the tumourigenic process. These genes were treated as tumour suppressor genes mapping to the 16q24.3 LOH interval. These genes are represented by SEQ ID Numbers: 1 to 26 and are described in detail below.

[0134] BNO60 represents the PRDM7 gene which encodes a protein containing a SET domain and four C2H2 type zinc finger domains. This domain composition is characteristic to the PRDM protein family, which includes 17 proteins. This family is thought to play an important role in chromatin-mediated gene regulation, in development, and in cancer. The majority of studies have focused on the function of PRDM2. This gene is frequently deleted in human cancers, and its re-introduction in tumours causes growth suppression and apoptosis in animal tumour models. The SET domain has recently been shown to possess methyltransferase activity. This finding suggests that the PRDM proteins may influence gene expression by methylation of chromatin. On the basis of the domain composition of PRDM7, its similarity to the demonstrated tumour suppressor protein PRDM2, and its possible role in gene expression regulation as a chromatin methyltransferase, PRDM7 is an ideal tumour suppressor gene candidate.

[0135] A group of six genes were identified to encode proteins containing zinc finger domains. These included BNO224, BNO36, BNO44, BNO34, BNO208 and BNO230. The zinc finger domain composition in the predicted proteins of these genes suggests a possible role in DNA binding and/or protein complex interactions that may influence gene transcription events. Any protein involved in gene transcription regulation is of central interest in the elucidation of gene expression control pathways in breast cancer.

[0136] From database analysis, the EST clones representing BNO224 were wrongly incorporated in the EST cluster representing the Fanconi Anemia A gene (FANCA). Our detailed in silico analysis of the genomic sequence in this region has indicated that the BNO224 gene is distinct and its direction of transcription is opposite to that of FANCA. The two genes converge with their last exons overlapping. The BNO224/FANCA genomic locus is conserved in the mouse with these two genes converging and overlapping (Wong et al, 2000). Analyses of the predicted protein product indicates that this gene codes for a protein that includes 5 zinc finger domains two of which appear to be forming a PHD finger domain. The PHD finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from, the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interactions and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions.

[0137] BNO36 was identified through extensive in silico analysis coupled with GENSCAN gene prediction methods. The human genomic sequence at 16q24.3 identified a complete mouse gene sequence (Mm.103673) but failed to identify corresponding human EST sequences. The GENSCAN program was used on the human genomic sequence in this region and predicted exons were identified that overlapped with regions exhibiting homology to the mouse sequence. BNO36 therefore represents the human orthologue of this mouse gene. BNO36 codes for a 320 amino acid protein that exhibits homology to the SNAI1 gene on chromosome 20. SNAI1 has been found to exhibit reduced expression in breast cancer tissue suggesting a protective role for this protein in normal breast tissue.

[0138] BNO44 consists of 2 isoforms and was isolated using a combination of cDNA walking, sequencing of IMAGE cDNA clones, homology searches with mouse EST sequences and exon trapping (see Whitmore et al., 1998 for the exon trapping procedure used). The short form of the gene is 794 base pairs in length (SEQ ID NO: 23), consists of 6 exons and codes for a protein of 197 amino acids (SEQ ID NO: 24). The longer version of the gene is 3,525 base pairs in length (SEQ ID NO: 25), consists of 13 exons and codes for a protein of 451 amino acids (SEQ ID NO: 26). This version of the gene is the result of the use of additional 3′ exons not used in the short form of BNO44. In silico analysis of the long form of BNO44 identifies two functional motifs. These include a DAG/PE binding domain incorporating amino acids 138-186 and PHD/LAP motif incorporating amino acids 384-431. The DAG/PE binding domain has been found in a family of serine/threonine protein kinases which include oncoproteins. The PHD/LAP motif is a zinc finger motif found in a group of proteins known to have function in chromatin remodeling, suggesting its role in transcription regulation. The short form of the BNO44 protein does not show sequence homology to any functional motifs present in available databases.

[0139] BNO55 corresponds to the CDT1 gene which is required at the initiation step of transcription (replication licensing). This gene mediates the loading of pre-replication complexes on origin-of-replication sites and is destabilized from chromatin once pre-replication complexes have formed. Its potential involvement in the initiation of DNA replication may argue that this is not a good tumour suppressor candidate. However, after initiation licensing this protein dissociates from the replication origin preventing any further initiation events. Mutations in this gene may render the protein incapable of dissociating leading to continuous replication initiation.

[0140] BNO229 corresponds to the CHMP1 gene which is translated from an alternative reading frame located in the transcript coding for PRSM1. CHMP1 codes for a component of the Polycomb-group (PcG) that is involved in gene silencing through mechanisms involving modification of chromatin structure. Exogenous expression of CHMP1 results in cell arrest in the S-phase of the cell cycle suggesting a tumour suppressor role of the protein.

[0141] BNO228 corresponds to the BTG3 associated nuclear protein (BANP) which has been shown to interact with the BTG3 protein (Birot et al., 2000). Currently no function has been assigned to BANP, but the BTG family members are believed to have an antiproliferative function. For example, BTG3 RNA expression is associated with different cell cycle arrest processes and BTG3 in vitro overexpression is antiproliferative (Yoshida et al, 1998). The underlying mechanism by which BTG family members induce growth inhibition remain undefined. It has been recently hypothesised that BTG2 interacts with the classical tumour suppressor Rb, maintaining its hypophosphorylated state and preventing cell division. (Guardavaccaro et al, 2000).

EXAMPLE 6 Analysis of Tumours and Cell Lines for Tumour Suppressor Gene Mutations

[0142] Any one of the tumour suppressor genes that have been identified in this study can be screened by single strand conformation polymorphism (SSCP) analysis in DNA isolated from tumours which display restricted LOH for the 16q24.3 region. Mutations specifically identified in these genes in cancerous tissue will confirm an involvement of that gene in the cancer. In this instance DNA isolated from series 1 and series 2 tumours can be used. A number of breast cancer cell lines, or cell lines from other cancer types, may also be screened. Likewise, tissues from other cancer types can be screened by SSCP for disease causing mutations. Cell lines can be purchased from ATCC, grown according to manufacturers conditions, and DNA isolated from cultured cells using standard protocols (Wyman and White, 1980; Sambrook et al., 1989).

[0143] To perform mutation analysis of the tumour suppressor genes using the SSCP technique, a number of variations can be employed. For example, tumour suppressor gene exons can be amplified by PCR using flanking intronic primers, which are labeled at their 5′ ends with HEX. Typical PCR reactions are performed in 96-well plates in a volume of 10 ul using 30 ng of template DNA. Cycling conditions involve an initial denaturation step at 94° C. for 3 minutes followed by 35 cycles of 94° C. for 30 seconds, 60° C. for 11/2 minutes and 72° C. for 1^(1/2) minutes. A final extension step of 72° C. for 10 minutes follows. Twenty ul of loading dye comprising 50% (v/v) formamide, 12.5 mM EDTA and 0.02% (w/v) bromophenol blue is added to completed reactions which are subsequently run on 4% polyacrylamide gels and analysed on the GelScan 2000 system (Corbett Research, AUS) according to manufacturers specifications.

[0144] Those samples that display a bandshift compared with normal controls are considered to have a different nucleotide composition in the amplicon being analysed compared to that of normal controls. The amplicon can be sequenced in this sample and compared to wild-type sequence to determine the nucleotide differences. Any base changes that are present in a tumour sample but not present in the corresponding normal control sample from the same individual or in other normal individuals most likely represents a deleterious mutation. This is further confirmed if the base change also leads to an amino acid change or the generation of a truncated form of the protein.

EXAMPLE 7 Analysis of the Tumour Suppressor Genes

[0145] The following methods are used to determine the structure and function of any one of the breast cancer genes.

[0146] Biological Studies

[0147] Mammalian expression vectors containing tumour suppressor gene cDNA can be transfected into breast, prostate or other carcinoma cell lines that have lesions in the gene. Phenotypic reversion in cultures (eg cell morphology, growth of transformants in soft-agar, growth rate) and in animals (eg tumourigenicity in nude mice) is examined. These studies can utilise wild-type or mutant forms of the tumour suppressor genes. Deletion and missense mutants of these genes can be constructed by in vitro mutagenesis.

[0148] Molecular Biological Studies

[0149] The ability of any one of the tumour suppressor proteins to bind known and unknown proteins can be examined. These proteins may give an insight as to the biological pathways in which the tumour suppressor proteins participate. In turn, proteins within these pathways may provide suitable targets for therapeutic applications such as screening for small molecule interactors, as well as antisense and antibody-based therapies directed at these interactors.

[0150] The principle behind the yeast two-hybrid procedure is that many eukaryotic transcriptional activators, including those in yeast, consist of two discrete modular domains. The first is a DNA-binding domain that binds to a specific promoter sequence and the second is an activation domain that directs the RNA polymerase II complex to transcribe the gene downstream of the DNA binding site. Both domains are required for transcriptional activation as neither domain can activate transcription on its own. In the yeast two-hybrid procedure, the gene of interest or parts thereof (BAIT), is cloned in such a way that it is expressed as a fusion to a peptide that has a DNA binding domain. A second gene, or number of genes, such as those from a cDNA library (TARGET), is cloned so that it is expressed as a fusion to an activation domain. Interaction of the protein of interest with its binding partner brings the DNA-binding peptide together with the activation domain and initiates transcription of the reporter genes. The first reporter gene will select for yeast cells that contain interacting proteins (this reporter is usually a nutritional gene required for growth on selective media). The second reporter is used for confirmation and while being expressed in response to interacting proteins it is usually not required for growth.

[0151] Structural Studies

[0152] Tumour suppressor recombinant proteins can be produced in bacterial, yeast, insect and/or mammalian cells and used in crystallographical and NMR studies. Together with molecular modeling of the proteins, structure-driven drug design can be facilitated.

EXAMPLE 8 Generation of Polyclonal Antibodies Against the Tumour Suppressor Proteins

[0153] The knowledge of the nucleotide and amino acid sequence of the tumour suppressor genes and associated proteins allows for the production of antibodies, which selectively bind to these proteins or fragments thereof. Following the identification of mutations in these tumour suppressor genes, antibodies can also be made to selectively bind and distinguish mutant from normal protein. Antibodies specific for mutagenised epitopes are especially useful in cell culture assays to screen for malignant cells at different stages of malignant development. These antibodies may also be used to screen malignant cells, which have been treated with pharmaceutical agents to evaluate the therapeutic potential of the agent.

[0154] To prepare polyclonal antibodies, short peptides can be designed homologous to any one of the tumour suppressor amino acid sequences. Such peptides are typically 10 to 15 amino acids in length. These peptides should be designed in regions of least homology to the mouse orthologue to avoid cross species interactions in further down-stream experiments such as monoclonal antibody production. Synthetic peptides can then be conjugated to biotin (Sulfo-NHS-LC Biotin) using standard protocols supplied with commercially available kits such as the PIERCE™ kit (PIERCE). Biotinylated peptides are subsequently complexed with avidin in solution and for each peptide complex, 2 rabbits are immunized with 4 doses of antigen (200 ug per dose) in intervals of three weeks between doses. The initial dose is mixed with Freund's Complete adjuvant while subsequent doses are combined with Freund's Immuno-adjuvant. After completion of the immunization, rabbits are test bled and reactivity of sera assayed by dot blot with serial dilutions of the original peptides. If rabbits show significant reactivity compared with pre-immune sera, they are then sacrificed and the blood collected such that immune sera can separated for further experiments.

EXAMPLE 9 Generation of Monoclonal Antibodies Specific for the Tumour Suppressor Proteins

[0155] Monoclonal antibodies can be prepared for any one of the tumour suppressor proteins in the following manner. Immunogen comprising an intact breast cancer protein or peptide (wild type or mutant) is injected in Freund's adjuvant into mice with each mouse receiving four injections of 10 to 100 ug of immunogen. After the fourth injection blood samples taken from the mice are examined for the presence of antibody to the immunogen. Immune mice are sacrificed, their spleens removed and single cell suspensions are prepared (Harlow and Lane, 1988). The spleen cells serve as a source of lymphocytes, which are then fused with a permanently growing myeloma partner cell (Kohler and Milstein, 1975). Cells are plated at a density of 2×10⁵ cells/well in 96 well plates and individual wells are examined for growth. These wells are then tested for the presence of specific antibodies by ELISA or RIA using wild type or mutant breast cancer target protein. Cells in positive wells are expanded and subcloned to establish and confirm monoclonality. Clones with the desired specificity are expanded and grown as ascites in mice followed by purification using affinity chromatography using Protein A Sepharose, ion-exchange chromatography or variations and combinations of these techniques.

INDUSTRIAL APPLICABILITY

[0156] The DNA sequences of the present invention are useful in the diagnosis of cancer, or a pre-disposition thereto. Methods of treatment of cancer and methods for screening for drugs are also made available. TABLE 1 Tumour Suppressor Genes Identified at Chromosome 16q24.3 Associated SEQ ID Gene UniGene Cluster Descripon Numbers BNO60 Hs.278234 PR domain containing 7 protein (PRDM7) 1 to 4 BNO224 None ESTs 5, 6 BNO55 Hs. 122908 CDT1 DNA replication factor 7, 8 BNO36 None ESTs 9, 10 BNO34 Hs.105280 Homo sapiens, clone MGC:14381, mRNA, complete coding sequence 11, 12 BNO208 Hs.54925 Homo sapiens mRNA for KIAA1858 protein; partial coding sequence 13, 14 BNO230 Hs.147733 Homo sapiens cDNA FLJ31875 fis, clone NT2RP7002450 15, 16 BNO229 None Charged multivesicular body protein 1/chromatin modifying protein 1 (CHMP1) 17, 18 BNO228 Hs.194637 BTG3 associated nuclear protein (BANP) 19 to 22 BNO44 Hs.65021 ESTs 23 to 26

REFERENCES

[0157] References cited herein are listed on the following pages, and are incorporated herein by this reference. 3

[0158] Altschulμ, S F. et al. (1997). Nucleic Acids Res. 25: 3389-3402.

[0159] Birot, A. et al. (2000). Gene 253: 189-196.

[0160] Brenner, A J. and Aldaz C M. (1995). Cancer Res. 55: 2892-2895.

[0161] Callen, D F. et al. (1995). Genomics 29: 503-511.

[0162] Chen, T. et al. (1996). Cancer Res. 56: 5605-5609.

[0163] Cleton-Jansen, A-M. et al. (1995). Br. J. Cancer 72: 1241-1244.

[0164] Cole, S P. et al. (1984). Mol. Cell Biol. 62: 109-120.

[0165] Cote, R J. et al. (1983). Proc. Natl. Acad. Sci. USA 80: 2026-2030.

[0166] Culver, K. (1996). Gene Therapy: A Primer for Physicians. Second Edition. (Mary Ann Liebert).

[0167] Devilee, P. et al. (1991). Oncogene 6: 1705-1711.

[0168] Devilee, P. et al. (1994). Genes Chrom. Cancer 11: 71-78.

[0169] Devilee, P. and Cornelisse, C J. (1994). Biochimica et Biophysica Acta 1198: 113-130.

[0170] Doggett, M A. et al. (1995). Nature 377 Suppl: 335-365.

[0171] Elston, C W. and Ellis, I O. (1990). Histopathology 16: 109-118.

[0172] Fearon, E R. and Vogelstein, B. (1990). Cell 61: 759-767.

[0173] Friedman, T. (1991). In Therapy for Genetic Diseases. T Friedman (Ed). Oxford University Press. pp 105-121.

[0174] Futreal, P A. et al. (1994). Science 266: 120-122.

[0175] Groden, J. et al. (1991). Cell 66: 589-600.

[0176] Guardavaccaro, D. et al. (2000). Mol. Cell Biol. 20: 1797-1815.

[0177] Hall, J M. et al. (1990). Science 250: 1684-1689.

[0178] Harlow, E. and Lane, D. (1988). Antibodies: A Laboratory Manual (Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.).

[0179] Heller, R A. et al. (1997). Proc. Natl. Acad. Sci. USA 94: 2150-2155.

[0180] Huse, W D. et al. (1989). Science 246: 1275-1281.

[0181] Kohler, G. and Milstein, C. (1975). Nature 256: 495-497.

[0182] Kozbor, D. et al. (1985). J. Immunol. Methods 81:31-42.

[0183] Longmire, J L. et al. (1993). GATA 10: 69-76.

[0184] McCormick, M K. et al. (1993). Proc. Natl. Acad. Sci. USA 90: 1063-1067.

[0185] Miki, Y. et al. (1994). Science 266: 66-71.

[0186] Miki, Y. et al. (1996). Nature Genet. 13: 245-247.

[0187] Orlandi, R. et al. (1989). Proc. Natl. Acad. Sci. USA 86: 3833-3837.

[0188] Radford, D M. et al. (1995). Cancer Res. 55: 3399-3405.

[0189] Riethman, H C. et al. (1989). Proc. Natl. Acad. Sci. USA 86: 6240-6244.

[0190] Saito, H. et al. (1993). Cancer Res. 53: 3382-3385.

[0191] Sambrook, J. et al. (1989). Molecular cloning: a laboratory manual. Second Edition. (Cold Spring Harbour Laboratory Press, New York).

[0192] Scharf, D. et al. (1994). Results Probl. Cell Differ. 20: 125-162.

[0193] Schena, M. et al. (1996). Proc. Natl. Acad. Sci. USA 93: 10614-10619.

[0194] Soares, M B. et al. (1994). Proc. Natl. Acad. Sci. USA 91: 9228-9232.

[0195] Weber, J L. and May, P E. (1989). Am. J. Hum. Genet. 44: 388-396.

[0196] Whitmore, S A. et al. (1994). Genomics 20: 169-175.

[0197] Whitmore, S A. et al. (1998). Genomics 50: 1-8.

[0198] WHO. (1981). Histological Typing of Breast Tumours. Second Edition. (Geneva).

[0199] Winter, G. et al. (1991). Nature 349: 293-299.

[0200] Wong, J C. et al. (2000). Genomics 67: 273-283.

[0201] Wooster, R. et al. (1995). Nature 378: 789-791.

[0202] Wooster, R. et al. (1994). Science 265: 2088-2090.

[0203] Wyman, A. and White, R. (1980). Proc. Natl. Acad. Sci. USA 77: 6754-6758.

[0204] Yoshida, Y. et al. (1998). Oncogene 16: 2687-2693.

1 26 1 2635 DNA Homo sapiens CDS (142)..(1362) 1 atcattccct tactcgaatg aattaaagta caggattggg gctaaataat gtgagtgcca 60 cgctctttct gaagctctta tatcaaggaa catgcattac aactttccta atccctgctt 120 ccctcacttc cagattgtga g atg tgt cag aac ttc ttc att gac agc tgt 171 Met Cys Gln Asn Phe Phe Ile Asp Ser Cys 1 5 10 gct gct cat ggg ccc cct aca ttt gta aag gac agt gca gtg gac aag 219 Ala Ala His Gly Pro Pro Thr Phe Val Lys Asp Ser Ala Val Asp Lys 15 20 25 ggg cat ccc aac cgt tca gcc ctc agt ctg ccc ccg ggg ctg aga att 267 Gly His Pro Asn Arg Ser Ala Leu Ser Leu Pro Pro Gly Leu Arg Ile 30 35 40 ggg cca tca ggc atc cct cag gct ggg ctt gga gta tgg aac gag gca 315 Gly Pro Ser Gly Ile Pro Gln Ala Gly Leu Gly Val Trp Asn Glu Ala 45 50 55 tct gat ctg cca ctg ggt ctg cac ttt ggc ccc tat gag ggc cga att 363 Ser Asp Leu Pro Leu Gly Leu His Phe Gly Pro Tyr Glu Gly Arg Ile 60 65 70 aca gaa gac gaa gag gca gcc aac agt gga tat tcc tgg cta atc acc 411 Thr Glu Asp Glu Glu Ala Ala Asn Ser Gly Tyr Ser Trp Leu Ile Thr 75 80 85 90 aag ggg aga aac tgc tat gag tat gtg gat gga aaa gat aaa tcc tcg 459 Lys Gly Arg Asn Cys Tyr Glu Tyr Val Asp Gly Lys Asp Lys Ser Ser 95 100 105 gcc aac tgg atg agg tat gtg aac tgt gcc cgg gat gat gaa gag cag 507 Ala Asn Trp Met Arg Tyr Val Asn Cys Ala Arg Asp Asp Glu Glu Gln 110 115 120 aac ctg gtg gcc ttc cag tac cac agg cag atc ttc tat aga acc tgc 555 Asn Leu Val Ala Phe Gln Tyr His Arg Gln Ile Phe Tyr Arg Thr Cys 125 130 135 cga gtc att agg cca ggc tgt gaa ctg ctg gtc tgg tat ggg gat gag 603 Arg Val Ile Arg Pro Gly Cys Glu Leu Leu Val Trp Tyr Gly Asp Glu 140 145 150 tat ggc cag gaa ctg ggc atc aag tgg ggc agc aag tgg aag aaa gag 651 Tyr Gly Gln Glu Leu Gly Ile Lys Trp Gly Ser Lys Trp Lys Lys Glu 155 160 165 170 ctc atg gca ggg aga gaa cca aag cca gag atc cat cca tgt ccc tca 699 Leu Met Ala Gly Arg Glu Pro Lys Pro Glu Ile His Pro Cys Pro Ser 175 180 185 tgc tgt ctg gcc ttt tca agt caa aaa ttt ctc agt caa cat gtg gaa 747 Cys Cys Leu Ala Phe Ser Ser Gln Lys Phe Leu Ser Gln His Val Glu 190 195 200 cgc aat cac tcc tct cag aac ttc cca gga cca tct gca aga aaa ctt 795 Arg Asn His Ser Ser Gln Asn Phe Pro Gly Pro Ser Ala Arg Lys Leu 205 210 215 ctc caa cca gag aat ccc tgc cca ggg gat cag aat cag gag cgg caa 843 Leu Gln Pro Glu Asn Pro Cys Pro Gly Asp Gln Asn Gln Glu Arg Gln 220 225 230 tat tct gat cca cgc tgc tgt aat gac aaa acc aaa ggt caa gag atc 891 Tyr Ser Asp Pro Arg Cys Cys Asn Asp Lys Thr Lys Gly Gln Glu Ile 235 240 245 250 aaa gaa agg tcc aaa ctc ttg aat aaa agg aca tgg cag agg gag att 939 Lys Glu Arg Ser Lys Leu Leu Asn Lys Arg Thr Trp Gln Arg Glu Ile 255 260 265 tca agg gcc ttt tct agc cca ccc aaa gga caa atg ggg agc tct aga 987 Ser Arg Ala Phe Ser Ser Pro Pro Lys Gly Gln Met Gly Ser Ser Arg 270 275 280 gtg gga gaa aga atg atg gaa gaa gag tcc aga aca ggc cag aaa gtg 1035 Val Gly Glu Arg Met Met Glu Glu Glu Ser Arg Thr Gly Gln Lys Val 285 290 295 aat cca ggg aac aca ggc aaa tta ttt gtg ggg gta gga atc tca aga 1083 Asn Pro Gly Asn Thr Gly Lys Leu Phe Val Gly Val Gly Ile Ser Arg 300 305 310 att gcg aaa gtc aaa tat gga gag tgt ggg caa ggt ttc agt gat aag 1131 Ile Ala Lys Val Lys Tyr Gly Glu Cys Gly Gln Gly Phe Ser Asp Lys 315 320 325 330 tca gat gtt att aca cac caa agg aca cac aca ggg ggg aag ccc tac 1179 Ser Asp Val Ile Thr His Gln Arg Thr His Thr Gly Gly Lys Pro Tyr 335 340 345 gtc tgc aga gag tgt ggg gag ggc ttt agc cgg aag tca gac ctc ctc 1227 Val Cys Arg Glu Cys Gly Glu Gly Phe Ser Arg Lys Ser Asp Leu Leu 350 355 360 agt cac cag agg aca cac aca ggg gag aag cct tat gtc tgc aga gag 1275 Ser His Gln Arg Thr His Thr Gly Glu Lys Pro Tyr Val Cys Arg Glu 365 370 375 tgt gag cgg ggc ttt agc cgg aag tca gtc ctc ctc att cac cag agg 1323 Cys Glu Arg Gly Phe Ser Arg Lys Ser Val Leu Leu Ile His Gln Arg 380 385 390 aca cac agg gga gaa gcc cca gtc tgc agg aag gat gag taagtcatta 1372 Thr His Arg Gly Glu Ala Pro Val Cys Arg Lys Asp Glu 395 400 405 gtaataaaac cttatctcaa tagccacaag aagacaaacg tgatcaccac acacttgcac 1432 accccagctc tgaggtggct tcagcgaaag tctgctaacc ccttacattc cccgagagtg 1492 taaagagatc ggaaataact aattaaacaa atccgccact ttcatgacta gagttgagga 1552 agaacagggg atagttctgt aagtgttcgg gggacgtcaa catgtgtggt tgtttcccgc 1612 actgatcccc tccatttttt gtgttttgcc tcctgttcta attaattttg tctccataca 1672 tatctgaacc ccaagtgtgt acctcattct tcccttatca ctgaaggaag gaagagtcca 1732 gaagggccac agagaactca aacgttcagt tcaagtctcc acaggaattc aaccccagaa 1792 agacataaac ttggagtcca tctggtttaa ttattggaga atcgattccc aagtccagga 1852 agagaaatgt agggttttac agagtcgcag caggaaagag agctccctgg tctcctggga 1912 agtgtgacct cttctaatgg acccctctcc tctgctgcca tactccccct tggctccccc 1972 tgtctcctct cctgatttcc tccaatctct gtagccccag aagtgaacgc cagacaggaa 2032 cacgcatgtg tgtatatatg tgttcacgtg tgctatgtgt gttaagcctg catgcatggg 2092 tgtgggggta tgtgccctct gtgtacgtat ctgtgtgagt gtgggggttt caagggtgta 2152 ttaggaataa cgctcaaaat cctaaggaaa ttgaatactc tgagagaaga gagacagacc 2212 ctctcatact gttttatatt gttttatact cagaaaagga aaaagaagca aaactaaagg 2272 caggtagcct ggcgcctagg aaccagacct gaaaccaagg aaccagaccc gaaaccaggc 2332 ctgggccggc ctgacctaag cctggtagtt aaaattcgac ccctgaccta gcaactgatg 2392 ttatctatag attatagaaa gacattgtga aacttcccgg tctgttctgt tccactctga 2452 ccatcggtgc atgcagcccc tgtcacctac cccctgcttg ctcaatcgat cacgaccctc 2512 tcacgtggac ccccttagag ttgtgagccc ttaaaaggga caggaattgc tcactcgggg 2572 agctcggctc ttgagacagc agtcttgctg atgctcctgg ccgaataaac cgcttccttc 2632 ttt 2635 2 407 PRT Homo sapiens 2 Met Cys Gln Asn Phe Phe Ile Asp Ser Cys Ala Ala His Gly Pro Pro 1 5 10 15 Thr Phe Val Lys Asp Ser Ala Val Asp Lys Gly His Pro Asn Arg Ser 20 25 30 Ala Leu Ser Leu Pro Pro Gly Leu Arg Ile Gly Pro Ser Gly Ile Pro 35 40 45 Gln Ala Gly Leu Gly Val Trp Asn Glu Ala Ser Asp Leu Pro Leu Gly 50 55 60 Leu His Phe Gly Pro Tyr Glu Gly Arg Ile Thr Glu Asp Glu Glu Ala 65 70 75 80 Ala Asn Ser Gly Tyr Ser Trp Leu Ile Thr Lys Gly Arg Asn Cys Tyr 85 90 95 Glu Tyr Val Asp Gly Lys Asp Lys Ser Ser Ala Asn Trp Met Arg Tyr 100 105 110 Val Asn Cys Ala Arg Asp Asp Glu Glu Gln Asn Leu Val Ala Phe Gln 115 120 125 Tyr His Arg Gln Ile Phe Tyr Arg Thr Cys Arg Val Ile Arg Pro Gly 130 135 140 Cys Glu Leu Leu Val Trp Tyr Gly Asp Glu Tyr Gly Gln Glu Leu Gly 145 150 155 160 Ile Lys Trp Gly Ser Lys Trp Lys Lys Glu Leu Met Ala Gly Arg Glu 165 170 175 Pro Lys Pro Glu Ile His Pro Cys Pro Ser Cys Cys Leu Ala Phe Ser 180 185 190 Ser Gln Lys Phe Leu Ser Gln His Val Glu Arg Asn His Ser Ser Gln 195 200 205 Asn Phe Pro Gly Pro Ser Ala Arg Lys Leu Leu Gln Pro Glu Asn Pro 210 215 220 Cys Pro Gly Asp Gln Asn Gln Glu Arg Gln Tyr Ser Asp Pro Arg Cys 225 230 235 240 Cys Asn Asp Lys Thr Lys Gly Gln Glu Ile Lys Glu Arg Ser Lys Leu 245 250 255 Leu Asn Lys Arg Thr Trp Gln Arg Glu Ile Ser Arg Ala Phe Ser Ser 260 265 270 Pro Pro Lys Gly Gln Met Gly Ser Ser Arg Val Gly Glu Arg Met Met 275 280 285 Glu Glu Glu Ser Arg Thr Gly Gln Lys Val Asn Pro Gly Asn Thr Gly 290 295 300 Lys Leu Phe Val Gly Val Gly Ile Ser Arg Ile Ala Lys Val Lys Tyr 305 310 315 320 Gly Glu Cys Gly Gln Gly Phe Ser Asp Lys Ser Asp Val Ile Thr His 325 330 335 Gln Arg Thr His Thr Gly Gly Lys Pro Tyr Val Cys Arg Glu Cys Gly 340 345 350 Glu Gly Phe Ser Arg Lys Ser Asp Leu Leu Ser His Gln Arg Thr His 355 360 365 Thr Gly Glu Lys Pro Tyr Val Cys Arg Glu Cys Glu Arg Gly Phe Ser 370 375 380 Arg Lys Ser Val Leu Leu Ile His Gln Arg Thr His Arg Gly Glu Ala 385 390 395 400 Pro Val Cys Arg Lys Asp Glu 405 3 2440 DNA Homo sapiens CDS (142)..(657) 3 atcattccct tactcgaatg aattaaagta caggattggg gctaaataat gtgagtgcca 60 cgctctttct gaagctctta tatcaaggaa catgcattac aactttccta atccctgctt 120 ccctcacttc cagattgtga g atg tgt cag aac ttc ttc att gac agc tgt 171 Met Cys Gln Asn Phe Phe Ile Asp Ser Cys 1 5 10 gct gct cat ggg ccc cct aca ttt gta aag gac agt gca gtg gac aag 219 Ala Ala His Gly Pro Pro Thr Phe Val Lys Asp Ser Ala Val Asp Lys 15 20 25 ggg cat ccc aac cgt tca gcc ctc agt ctg ccc ccg ggg ctg aga att 267 Gly His Pro Asn Arg Ser Ala Leu Ser Leu Pro Pro Gly Leu Arg Ile 30 35 40 ggg cca tca ggc atc cct cag gct ggg ctt gga gta tgg aac gag gca 315 Gly Pro Ser Gly Ile Pro Gln Ala Gly Leu Gly Val Trp Asn Glu Ala 45 50 55 tct gat ctg cca ctg ggt ctg cac ttt ggc ccc tat gag ggc cga att 363 Ser Asp Leu Pro Leu Gly Leu His Phe Gly Pro Tyr Glu Gly Arg Ile 60 65 70 aca gaa gac gaa gag gca gcc aac agt gga tat tcc tgg cta atc acc 411 Thr Glu Asp Glu Glu Ala Ala Asn Ser Gly Tyr Ser Trp Leu Ile Thr 75 80 85 90 aag ggg aga aac tgc tat gag tat gtg gat gga aaa gat aaa tcc tcg 459 Lys Gly Arg Asn Cys Tyr Glu Tyr Val Asp Gly Lys Asp Lys Ser Ser 95 100 105 gcc aac tgg atg aga acc aaa gcc aga gat cca tcc atg tcc ctc atg 507 Ala Asn Trp Met Arg Thr Lys Ala Arg Asp Pro Ser Met Ser Leu Met 110 115 120 ctg tct ggc ctt ttc aag tca aaa att tct cag tca aca tgt gga acg 555 Leu Ser Gly Leu Phe Lys Ser Lys Ile Ser Gln Ser Thr Cys Gly Thr 125 130 135 caa tca ctc ctc tca gaa ctt ccc agg acc atc tgc aag aaa act tct 603 Gln Ser Leu Leu Ser Glu Leu Pro Arg Thr Ile Cys Lys Lys Thr Ser 140 145 150 cca acc aga gaa tcc ctg ccc agg gga tca gaa tca gga gcg gca ata 651 Pro Thr Arg Glu Ser Leu Pro Arg Gly Ser Glu Ser Gly Ala Ala Ile 155 160 165 170 ttc tga tccacgctgc tgtaatgaca aaaccaaagg tcaagagatc aaagaaaggt 707 Phe ccaaactctt gaataaaagg acatggcaga gggagatttc aagggccttt tctagcccac 767 ccaaaggaca aatggggagc tctagagtgg gagaaagaat gatggaagaa gagtccagaa 827 caggccagaa agtgaatcca gggaacacag gcaaattatt tgtgggggta ggaatctcaa 887 gaattgcgaa agtcaaatat ggagagtgtg ggcaaggttt cagtgataag tcagatgtta 947 ttacacacca aaggacacac acagggggga agccctacgt ctgcagagag tgtgggaggg 1007 ctttagccgg aagtcagacc tcctcagtca ccagaggaca cacacagggg agaagcctta 1067 tgtctgcaga gagtgtgagc ggggctttag ccggaagtca gtcctcctca ttcaccagag 1127 gacacacagg ggagaagccc cagtctgcag gaaggatgag taagtcatta gtaataaaac 1187 cttatctcaa tagccacaag aagacaaacg tgatcaccac acacttgcac accccagctc 1247 tgaggtggct tcagcgaaag tctgctaacc ccttacattc cccgagagtg taaagagatc 1307 ggaaataact aattaaacaa atccgccact ttcatgacta gagttgagga agaacagggg 1367 atagttctgt aagtgttcgg gggacgtcaa catgtgtggt tgtttcccgc actgatcccc 1427 tccatttttt gtgttttgcc tcctgttcta attaattttg tctccataca tatctgaacc 1487 ccaagtgtgt acctcattct tcccttatca ctgaaggaag gaagagtcca gaagggccac 1547 agagaactca aacgttcagt tcaagtctcc acaggaattc aaccccagaa agacataaac 1607 ttggagtcca tctggtttaa ttattggaga atcgattccc aagtccagga agagaaatgt 1667 agggttttac agagtcgcag caggaaagag agctccctgg tctcctggga agtgtgacct 1727 cttctaatgg acccctctcc tctgctgcca tactccccct tggctccccc tgtctcctct 1787 cctgatttcc tccaatctct gtagccccag aagtgaacgc cagacaggaa cacgcatgtg 1847 tgtatatatg tgttcacgtg tgctatgtgt gttaagcctg catgcatggg tgtgggggta 1907 tgtgccctct gtgtacgtat ctgtgtgagt gtgggggttt caagggtgta ttaggaataa 1967 cgctcaaaat cctaaggaaa ttgaatactc tgagagaaga gagacagacc ctctcatact 2027 gttttatatt gttttatact cagaaaagga aaaagaagca aaactaaagg caggtagcct 2087 ggcgcctagg aaccagacct gaaaccaagg aaccagaccc gaaaccaggc ctgggccggc 2147 ctgacctaag cctggtagtt aaaattcgac ccctgaccta gcaactgatg ttatctatag 2207 attatagaaa gacattgtga aacttcccgg tctgttctgt tccactctga ccatcggtgc 2267 atgcagcccc tgtcacctac cccctgcttg ctcaatcgat cacgaccctc tcacgtggac 2327 ccccttagag ttgtgagccc ttaaaaggga caggaattgc tcactcgggg agctcggctc 2387 ttgagacagc agtcttgctg atgctcctgg ccgaataaac cgcttccttc ttt 2440 4 171 PRT Homo sapiens 4 Met Cys Gln Asn Phe Phe Ile Asp Ser Cys Ala Ala His Gly Pro Pro 1 5 10 15 Thr Phe Val Lys Asp Ser Ala Val Asp Lys Gly His Pro Asn Arg Ser 20 25 30 Ala Leu Ser Leu Pro Pro Gly Leu Arg Ile Gly Pro Ser Gly Ile Pro 35 40 45 Gln Ala Gly Leu Gly Val Trp Asn Glu Ala Ser Asp Leu Pro Leu Gly 50 55 60 Leu His Phe Gly Pro Tyr Glu Gly Arg Ile Thr Glu Asp Glu Glu Ala 65 70 75 80 Ala Asn Ser Gly Tyr Ser Trp Leu Ile Thr Lys Gly Arg Asn Cys Tyr 85 90 95 Glu Tyr Val Asp Gly Lys Asp Lys Ser Ser Ala Asn Trp Met Arg Thr 100 105 110 Lys Ala Arg Asp Pro Ser Met Ser Leu Met Leu Ser Gly Leu Phe Lys 115 120 125 Ser Lys Ile Ser Gln Ser Thr Cys Gly Thr Gln Ser Leu Leu Ser Glu 130 135 140 Leu Pro Arg Thr Ile Cys Lys Lys Thr Ser Pro Thr Arg Glu Ser Leu 145 150 155 160 Pro Arg Gly Ser Glu Ser Gly Ala Ala Ile Phe 165 170 5 2205 DNA Homo sapiens CDS (679)..(1881) 5 acgccgggtc ctctaggaac ctcgggccgg gcagcacccg cgggattctg ctggcgtcct 60 ccgctgccat gaagcgggac cggctgggcc gcttcctgtc tcctgggtcg tcccgacagt 120 gcggggcctc ggacggcggc ggcggcgtca gccggactcg gggccgccct tcccttagcg 180 gtgggccgag ggtggacggg gcgacggcgc ggcgcgcctg gggcccggtg gggtcctgcg 240 gggacgcggg cgaggacggc gcggacgagg caggagcagg ccgggctctc gccatgggtc 300 actgtcgcct ctgccacggg aagttttcct cgagaagcct gcgcagcatc tccgagaggg 360 cgcctggagc gagcatggag aggccatccg cagaggagcg cgtgctcgta cgggacttcc 420 agcgcctgct tggtgtggct gtccgccagg accccacctt gtctccgttt gtctgcaaga 480 gctgccacgc ccagttctac cagtgccaca gccttctcaa gtccttcctg cagagggtca 540 acgcctcccc ggctggtcgc cggaagcctt gtgcaaaggt cggtgcccag cccccaacag 600 gggcagagga gggagcgtgt ctggtggatc tgatcacatc cagcccccag tgcctgcacg 660 gcttggtggg gtgggtgc atg gac atg cgg cca gct gcg ggg ccc tgc ccc 711 Met Asp Met Arg Pro Ala Ala Gly Pro Cys Pro 1 5 10 acc ttc aga gga cac tgt cct ccg agt act gcg gcg tca tcc agg tcg 759 Thr Phe Arg Gly His Cys Pro Pro Ser Thr Ala Ala Ser Ser Arg Ser 15 20 25 tgt ggg gct gcg acc agg gcc acg act aca cca tgg ata cca gct cca 807 Cys Gly Ala Ala Thr Arg Ala Thr Thr Thr Pro Trp Ile Pro Ala Pro 30 35 40 gct gca agg cct tct tgc tgg aca gtg cgc tgg cag tca agt ggc cat 855 Ala Ala Arg Pro Ser Cys Trp Thr Val Arg Trp Gln Ser Ser Gly His 45 50 55 ggg aca aag aga cgg cgc cac ggc tgc ccc agc acc gag ggt gga acc 903 Gly Thr Lys Arg Arg Arg His Gly Cys Pro Ser Thr Glu Gly Gly Thr 60 65 70 75 ctg ggg atg ccc ctc aga cct ccc agg gta gag gga cag gga ccc cag 951 Leu Gly Met Pro Leu Arg Pro Pro Arg Val Glu Gly Gln Gly Pro Gln 80 85 90 ttg ggg ctg aga cca aga ccc tgc cca gca cgg atg tgg ccc agc ctc 999 Leu Gly Leu Arg Pro Arg Pro Cys Pro Ala Arg Met Trp Pro Ser Leu 95 100 105 ctt cgg aca gcg acg cgg tgg ggc cca ggt cgg gct tcc cac ctc agc 1047 Leu Arg Thr Ala Thr Arg Trp Gly Pro Gly Arg Ala Ser His Leu Ser 110 115 120 caa gcc tgc ccc ttt gca ggg ccc cag ggc agt tgg gtg aga agc agc 1095 Gln Ala Cys Pro Phe Ala Gly Pro Gln Gly Ser Trp Val Arg Ser Ser 125 130 135 ttc cat ctt caa cct cgg atg atc gga gac gtc ttg agt gaa gat gaa 1143 Phe His Leu Gln Pro Arg Met Ile Gly Asp Val Leu Ser Glu Asp Glu 140 145 150 155 aat gac aag aag caa aat gcc cag tct tcg gac gag tcc ttt gag cct 1191 Asn Asp Lys Lys Gln Asn Ala Gln Ser Ser Asp Glu Ser Phe Glu Pro 160 165 170 tac cca gaa agg aaa gtc tct ggt aag aag agt gaa agc aaa gaa gcc 1239 Tyr Pro Glu Arg Lys Val Ser Gly Lys Lys Ser Glu Ser Lys Glu Ala 175 180 185 aag aag tct gaa gaa cca aga att cgg aag aag ccg gga ccc aag ccc 1287 Lys Lys Ser Glu Glu Pro Arg Ile Arg Lys Lys Pro Gly Pro Lys Pro 190 195 200 gga tgg aag aag aag ctt cgt tgt gag agg gag gag ctt ccc acc atc 1335 Gly Trp Lys Lys Lys Leu Arg Cys Glu Arg Glu Glu Leu Pro Thr Ile 205 210 215 tac aag tgt cct tac cag ggc tgc acg gcc gtg tac cga ggc gct gac 1383 Tyr Lys Cys Pro Tyr Gln Gly Cys Thr Ala Val Tyr Arg Gly Ala Asp 220 225 230 235 ggc atg aag aag cac atc aag gag cac cac gag gag gtc cgg gag cgg 1431 Gly Met Lys Lys His Ile Lys Glu His His Glu Glu Val Arg Glu Arg 240 245 250 ccc tgc ccc cac cct ggc tgc aac aag gtt ttc atg atc gac cgc tac 1479 Pro Cys Pro His Pro Gly Cys Asn Lys Val Phe Met Ile Asp Arg Tyr 255 260 265 ctg cag cgc cac gtg aag ctc atc cac aca gag gtg cgg aac tat atc 1527 Leu Gln Arg His Val Lys Leu Ile His Thr Glu Val Arg Asn Tyr Ile 270 275 280 tgt gac gaa tgt gga caa acc ttc aag cag cgg aag cac ctt ctc gtc 1575 Cys Asp Glu Cys Gly Gln Thr Phe Lys Gln Arg Lys His Leu Leu Val 285 290 295 cac caa atg cga cat tcg gga gcc aag cct ttg cag tgt gag gtc tgt 1623 His Gln Met Arg His Ser Gly Ala Lys Pro Leu Gln Cys Glu Val Cys 300 305 310 315 ggg ttc cag tgc agg cag cgg gca tcc ctc aag tac cac atg acc aaa 1671 Gly Phe Gln Cys Arg Gln Arg Ala Ser Leu Lys Tyr His Met Thr Lys 320 325 330 cac aag gct gag act gag ctg gac ttt gcc tgt gac cag tgt ggc cgg 1719 His Lys Ala Glu Thr Glu Leu Asp Phe Ala Cys Asp Gln Cys Gly Arg 335 340 345 cgg ttt gag aag gcc cac aac ctc aat gta cac atg tcc atg gtg cac 1767 Arg Phe Glu Lys Ala His Asn Leu Asn Val His Met Ser Met Val His 350 355 360 ccg ctg aca cag acc cag gac aag gcc ctg ccc ctg gag gcg gaa cca 1815 Pro Leu Thr Gln Thr Gln Asp Lys Ala Leu Pro Leu Glu Ala Glu Pro 365 370 375 cca cct ggg cca ccg agc ccc tct gtg acc aca gag ggc cag gcg gtg 1863 Pro Pro Gly Pro Pro Ser Pro Ser Val Thr Thr Glu Gly Gln Ala Val 380 385 390 395 aag ccc gaa ccc acc tga ggacggcagt gaggatgagc acctctagca 1911 Lys Pro Glu Pro Thr 400 gcctggactc cgcagtggct gtgtcagcct cacccttcgt gtgcacccgc atgggagggt 1971 cggagggtgc tgcccgccct tggtgctgga ggcgggcttg gtgtccggct caagtagcct 2031 tcctctgctc tgggaccagt ggtttatttt cccgcaaacg ctgagtgact cggggccgga 2091 cagttcataa ataattgatt cctttcccca ctaaagcagt cgaggagatt tgtaatccac 2151 tttttagtgc aacaagagct ccatgttatg cttgtaataa attatttaca cggg 2205 6 400 PRT Homo sapiens 6 Met Asp Met Arg Pro Ala Ala Gly Pro Cys Pro Thr Phe Arg Gly His 1 5 10 15 Cys Pro Pro Ser Thr Ala Ala Ser Ser Arg Ser Cys Gly Ala Ala Thr 20 25 30 Arg Ala Thr Thr Thr Pro Trp Ile Pro Ala Pro Ala Ala Arg Pro Ser 35 40 45 Cys Trp Thr Val Arg Trp Gln Ser Ser Gly His Gly Thr Lys Arg Arg 50 55 60 Arg His Gly Cys Pro Ser Thr Glu Gly Gly Thr Leu Gly Met Pro Leu 65 70 75 80 Arg Pro Pro Arg Val Glu Gly Gln Gly Pro Gln Leu Gly Leu Arg Pro 85 90 95 Arg Pro Cys Pro Ala Arg Met Trp Pro Ser Leu Leu Arg Thr Ala Thr 100 105 110 Arg Trp Gly Pro Gly Arg Ala Ser His Leu Ser Gln Ala Cys Pro Phe 115 120 125 Ala Gly Pro Gln Gly Ser Trp Val Arg Ser Ser Phe His Leu Gln Pro 130 135 140 Arg Met Ile Gly Asp Val Leu Ser Glu Asp Glu Asn Asp Lys Lys Gln 145 150 155 160 Asn Ala Gln Ser Ser Asp Glu Ser Phe Glu Pro Tyr Pro Glu Arg Lys 165 170 175 Val Ser Gly Lys Lys Ser Glu Ser Lys Glu Ala Lys Lys Ser Glu Glu 180 185 190 Pro Arg Ile Arg Lys Lys Pro Gly Pro Lys Pro Gly Trp Lys Lys Lys 195 200 205 Leu Arg Cys Glu Arg Glu Glu Leu Pro Thr Ile Tyr Lys Cys Pro Tyr 210 215 220 Gln Gly Cys Thr Ala Val Tyr Arg Gly Ala Asp Gly Met Lys Lys His 225 230 235 240 Ile Lys Glu His His Glu Glu Val Arg Glu Arg Pro Cys Pro His Pro 245 250 255 Gly Cys Asn Lys Val Phe Met Ile Asp Arg Tyr Leu Gln Arg His Val 260 265 270 Lys Leu Ile His Thr Glu Val Arg Asn Tyr Ile Cys Asp Glu Cys Gly 275 280 285 Gln Thr Phe Lys Gln Arg Lys His Leu Leu Val His Gln Met Arg His 290 295 300 Ser Gly Ala Lys Pro Leu Gln Cys Glu Val Cys Gly Phe Gln Cys Arg 305 310 315 320 Gln Arg Ala Ser Leu Lys Tyr His Met Thr Lys His Lys Ala Glu Thr 325 330 335 Glu Leu Asp Phe Ala Cys Asp Gln Cys Gly Arg Arg Phe Glu Lys Ala 340 345 350 His Asn Leu Asn Val His Met Ser Met Val His Pro Leu Thr Gln Thr 355 360 365 Gln Asp Lys Ala Leu Pro Leu Glu Ala Glu Pro Pro Pro Gly Pro Pro 370 375 380 Ser Pro Ser Val Thr Thr Glu Gly Gln Ala Val Lys Pro Glu Pro Thr 385 390 395 400 7 1898 DNA Homo sapiens CDS (26)..(1666) 7 acgcgtccgc gcgcactccg ccgcc atg gag cag cgc cgc gtc acc gac ttc 52 Met Glu Gln Arg Arg Val Thr Asp Phe 1 5 ttc gcg cgc cgc cgc ccc ggg ccc ccc cgc atc gcg ccg ccc aag ctg 100 Phe Ala Arg Arg Arg Pro Gly Pro Pro Arg Ile Ala Pro Pro Lys Leu 10 15 20 25 gcc tgc cgc acc ccc agc ccc gcc agg ccc gca ctc cgc gcc ccg gcc 148 Ala Cys Arg Thr Pro Ser Pro Ala Arg Pro Ala Leu Arg Ala Pro Ala 30 35 40 tcc gct acc agt ggc agc cgc aag cgc gcc cgc ccg ccc gcc gcc ccc 196 Ser Ala Thr Ser Gly Ser Arg Lys Arg Ala Arg Pro Pro Ala Ala Pro 45 50 55 gga cgc gac cag gcc agg cca ccg gcc cgc agg aga ctg cgg ctg tcg 244 Gly Arg Asp Gln Ala Arg Pro Pro Ala Arg Arg Arg Leu Arg Leu Ser 60 65 70 gtg gac gag gtt tcc agc ccc agt acc ccc gag gcc cca gac atc cca 292 Val Asp Glu Val Ser Ser Pro Ser Thr Pro Glu Ala Pro Asp Ile Pro 75 80 85 gcc tgc cct tct ccg ggc cag aag ata aag aaa tcc acc ccg gca gca 340 Ala Cys Pro Ser Pro Gly Gln Lys Ile Lys Lys Ser Thr Pro Ala Ala 90 95 100 105 ggt cag ccg ccc cac ctg aca tcc gcg cag gac cag gac acc atc tct 388 Gly Gln Pro Pro His Leu Thr Ser Ala Gln Asp Gln Asp Thr Ile Ser 110 115 120 gag ctt gcg tca tgc ctg caa cgg gcc cgg gag ctg ggg gca aga gtc 436 Glu Leu Ala Ser Cys Leu Gln Arg Ala Arg Glu Leu Gly Ala Arg Val 125 130 135 cgg gcg ctg aag gcc agt gcc cag gat gct ggg gag tcc tgc acc cca 484 Arg Ala Leu Lys Ala Ser Ala Gln Asp Ala Gly Glu Ser Cys Thr Pro 140 145 150 gag gcc gag ggc cgc cct gag gag cca tgt ggc gag aag gcg ccc gcc 532 Glu Ala Glu Gly Arg Pro Glu Glu Pro Cys Gly Glu Lys Ala Pro Ala 155 160 165 tac cag cgc ttc cat gcc ctg gcc cag ccc ggc ctg ccg gga ctc gtg 580 Tyr Gln Arg Phe His Ala Leu Ala Gln Pro Gly Leu Pro Gly Leu Val 170 175 180 185 ctg ccc tac aag tac cag gtg ctg gcg gag atg ttc cgc agc atg gac 628 Leu Pro Tyr Lys Tyr Gln Val Leu Ala Glu Met Phe Arg Ser Met Asp 190 195 200 acc atc gtg ggc atg ctc cac aac cgc tcc gag acg ccc acc ttt gcc 676 Thr Ile Val Gly Met Leu His Asn Arg Ser Glu Thr Pro Thr Phe Ala 205 210 215 aag gtc cag cgg ggc gtc cag gac atg atg cgt agg cgt ttt gag gag 724 Lys Val Gln Arg Gly Val Gln Asp Met Met Arg Arg Arg Phe Glu Glu 220 225 230 cgc aat gtt ggc cag atc aaa acc gtg tac ccg gcc tcc tac cgc ttc 772 Arg Asn Val Gly Gln Ile Lys Thr Val Tyr Pro Ala Ser Tyr Arg Phe 235 240 245 cgc cag gag cgc agt gtc ccc acc ttc aag gat ggc gcc agg agg tca 820 Arg Gln Glu Arg Ser Val Pro Thr Phe Lys Asp Gly Ala Arg Arg Ser 250 255 260 265 gat tac cag ctc acc atc gag cca ctg ctg gag cag gag gct gac gga 868 Asp Tyr Gln Leu Thr Ile Glu Pro Leu Leu Glu Gln Glu Ala Asp Gly 270 275 280 gca gcc ccc cag ctc acg gcc tcg cgc ctc ctg cag cga cgg cag atc 916 Ala Ala Pro Gln Leu Thr Ala Ser Arg Leu Leu Gln Arg Arg Gln Ile 285 290 295 ttc agc cag aag ctg gtg gag cac gtc aag gag cac cac aag gcc ttc 964 Phe Ser Gln Lys Leu Val Glu His Val Lys Glu His His Lys Ala Phe 300 305 310 ctg gcc tcc ctg agc ccc gcc atg gtg gtg ccg gag gac cag ctg acc 1012 Leu Ala Ser Leu Ser Pro Ala Met Val Val Pro Glu Asp Gln Leu Thr 315 320 325 cgc tgg cac ccg cgc ttc aac gtg gat gaa gta ccc gac atc gag ccg 1060 Arg Trp His Pro Arg Phe Asn Val Asp Glu Val Pro Asp Ile Glu Pro 330 335 340 345 gcc gcg ctg ccc cag cca ccc gcc acg gag aag ctc acc act gct cag 1108 Ala Ala Leu Pro Gln Pro Pro Ala Thr Glu Lys Leu Thr Thr Ala Gln 350 355 360 gag gtg ctg gcc cgg gcc cgc aac ctg att tca ccc agg atg gag aag 1156 Glu Val Leu Ala Arg Ala Arg Asn Leu Ile Ser Pro Arg Met Glu Lys 365 370 375 gcc ttg agt caa ttg gcc ctg cgc tct gct gcg ccc agc agc ccc ggg 1204 Ala Leu Ser Gln Leu Ala Leu Arg Ser Ala Ala Pro Ser Ser Pro Gly 380 385 390 tct ccc agg cca gca ctg ccg gct acc cca cca gcc acc ccg cct gca 1252 Ser Pro Arg Pro Ala Leu Pro Ala Thr Pro Pro Ala Thr Pro Pro Ala 395 400 405 gcc tct ccc agt gct ctg aag ggg gtg tcc cag gat ctg ctg gag cgg 1300 Ala Ser Pro Ser Ala Leu Lys Gly Val Ser Gln Asp Leu Leu Glu Arg 410 415 420 425 atc cga gcc aag gag gca cag aag cag ctg gca cag atg acg cgg tgc 1348 Ile Arg Ala Lys Glu Ala Gln Lys Gln Leu Ala Gln Met Thr Arg Cys 430 435 440 ccg gag cag gag cag cgg ctg cag cgc tta gaa cgg ctg cct gag ctg 1396 Pro Glu Gln Glu Gln Arg Leu Gln Arg Leu Glu Arg Leu Pro Glu Leu 445 450 455 gcc cgc gtg ctg cgg agc gtc ttt gtg tcc gaa cgc aag cct gcg ctc 1444 Ala Arg Val Leu Arg Ser Val Phe Val Ser Glu Arg Lys Pro Ala Leu 460 465 470 agc atg gag gtg gcc tgt gcc agg atg gtg ggc agc tgt tgt act atc 1492 Ser Met Glu Val Ala Cys Ala Arg Met Val Gly Ser Cys Cys Thr Ile 475 480 485 atg agc cct ggg gaa atg gag aag cac ctg ctg ctc ctc tcc gag ctg 1540 Met Ser Pro Gly Glu Met Glu Lys His Leu Leu Leu Leu Ser Glu Leu 490 495 500 505 ctg ccg gac tgg ctc agc ctc cac cgc atc cgc acc gac acc tac gtc 1588 Leu Pro Asp Trp Leu Ser Leu His Arg Ile Arg Thr Asp Thr Tyr Val 510 515 520 aag ctg gac aag gcc gcg gac ctg gcc cac atc act gca cgc ctg gcc 1636 Lys Leu Asp Lys Ala Ala Asp Leu Ala His Ile Thr Ala Arg Leu Ala 525 530 535 cac cag aca cgt gct gag gag ggg ctg tga gcctgggggc cactgtggac 1686 His Gln Thr Arg Ala Glu Glu Gly Leu 540 545 agacgtgggc ttcagaagct cgctggcctg ggcccaccag cattttcttt tatgaacatg 1746 atacactttg gtcttccttt ccccagcgcc cctgagggcc agaggcagat gtgggctgca 1806 ggctgcacag cccgagggtc tctggctgcg ggcggtgggc cccttcatgg ggctcacctg 1866 gtggattcac attaaaccgg tttctgtggg ca 1898 8 546 PRT Homo sapiens 8 Met Glu Gln Arg Arg Val Thr Asp Phe Phe Ala Arg Arg Arg Pro Gly 1 5 10 15 Pro Pro Arg Ile Ala Pro Pro Lys Leu Ala Cys Arg Thr Pro Ser Pro 20 25 30 Ala Arg Pro Ala Leu Arg Ala Pro Ala Ser Ala Thr Ser Gly Ser Arg 35 40 45 Lys Arg Ala Arg Pro Pro Ala Ala Pro Gly Arg Asp Gln Ala Arg Pro 50 55 60 Pro Ala Arg Arg Arg Leu Arg Leu Ser Val Asp Glu Val Ser Ser Pro 65 70 75 80 Ser Thr Pro Glu Ala Pro Asp Ile Pro Ala Cys Pro Ser Pro Gly Gln 85 90 95 Lys Ile Lys Lys Ser Thr Pro Ala Ala Gly Gln Pro Pro His Leu Thr 100 105 110 Ser Ala Gln Asp Gln Asp Thr Ile Ser Glu Leu Ala Ser Cys Leu Gln 115 120 125 Arg Ala Arg Glu Leu Gly Ala Arg Val Arg Ala Leu Lys Ala Ser Ala 130 135 140 Gln Asp Ala Gly Glu Ser Cys Thr Pro Glu Ala Glu Gly Arg Pro Glu 145 150 155 160 Glu Pro Cys Gly Glu Lys Ala Pro Ala Tyr Gln Arg Phe His Ala Leu 165 170 175 Ala Gln Pro Gly Leu Pro Gly Leu Val Leu Pro Tyr Lys Tyr Gln Val 180 185 190 Leu Ala Glu Met Phe Arg Ser Met Asp Thr Ile Val Gly Met Leu His 195 200 205 Asn Arg Ser Glu Thr Pro Thr Phe Ala Lys Val Gln Arg Gly Val Gln 210 215 220 Asp Met Met Arg Arg Arg Phe Glu Glu Arg Asn Val Gly Gln Ile Lys 225 230 235 240 Thr Val Tyr Pro Ala Ser Tyr Arg Phe Arg Gln Glu Arg Ser Val Pro 245 250 255 Thr Phe Lys Asp Gly Ala Arg Arg Ser Asp Tyr Gln Leu Thr Ile Glu 260 265 270 Pro Leu Leu Glu Gln Glu Ala Asp Gly Ala Ala Pro Gln Leu Thr Ala 275 280 285 Ser Arg Leu Leu Gln Arg Arg Gln Ile Phe Ser Gln Lys Leu Val Glu 290 295 300 His Val Lys Glu His His Lys Ala Phe Leu Ala Ser Leu Ser Pro Ala 305 310 315 320 Met Val Val Pro Glu Asp Gln Leu Thr Arg Trp His Pro Arg Phe Asn 325 330 335 Val Asp Glu Val Pro Asp Ile Glu Pro Ala Ala Leu Pro Gln Pro Pro 340 345 350 Ala Thr Glu Lys Leu Thr Thr Ala Gln Glu Val Leu Ala Arg Ala Arg 355 360 365 Asn Leu Ile Ser Pro Arg Met Glu Lys Ala Leu Ser Gln Leu Ala Leu 370 375 380 Arg Ser Ala Ala Pro Ser Ser Pro Gly Ser Pro Arg Pro Ala Leu Pro 385 390 395 400 Ala Thr Pro Pro Ala Thr Pro Pro Ala Ala Ser Pro Ser Ala Leu Lys 405 410 415 Gly Val Ser Gln Asp Leu Leu Glu Arg Ile Arg Ala Lys Glu Ala Gln 420 425 430 Lys Gln Leu Ala Gln Met Thr Arg Cys Pro Glu Gln Glu Gln Arg Leu 435 440 445 Gln Arg Leu Glu Arg Leu Pro Glu Leu Ala Arg Val Leu Arg Ser Val 450 455 460 Phe Val Ser Glu Arg Lys Pro Ala Leu Ser Met Glu Val Ala Cys Ala 465 470 475 480 Arg Met Val Gly Ser Cys Cys Thr Ile Met Ser Pro Gly Glu Met Glu 485 490 495 Lys His Leu Leu Leu Leu Ser Glu Leu Leu Pro Asp Trp Leu Ser Leu 500 505 510 His Arg Ile Arg Thr Asp Thr Tyr Val Lys Leu Asp Lys Ala Ala Asp 515 520 525 Leu Ala His Ile Thr Ala Arg Leu Ala His Gln Thr Arg Ala Glu Glu 530 535 540 Gly Leu 545 9 1391 DNA Homo sapiens CDS (88)..(1050) 9 cttcaagcgc cggcggacgc agcagtccgg acccaggcgc gcccctcccg ccccagccca 60 ccccggcctg ccgcccggga ggggaac atg ccg cgc tcc ttc ctg gtg aaa acg 114 Met Pro Arg Ser Phe Leu Val Lys Thr 1 5 cac tcc agc cac agg gtc ccc aac tac cgg cgg ctg gag acg cag aga 162 His Ser Ser His Arg Val Pro Asn Tyr Arg Arg Leu Glu Thr Gln Arg 10 15 20 25 gaa atc aat ggt gcc tgc tct gcc tgt ggg ggg ctg gtg gtg ccc ctc 210 Glu Ile Asn Gly Ala Cys Ser Ala Cys Gly Gly Leu Val Val Pro Leu 30 35 40 ctc ccc cga gac aag gag gcc cct tct gtg ccc ggt gac ctt ccc cag 258 Leu Pro Arg Asp Lys Glu Ala Pro Ser Val Pro Gly Asp Leu Pro Gln 45 50 55 ccc tgg gac cgc tcc tcg gcc gtc gcc tgc atc tcc ctg ccc ctc ctg 306 Pro Trp Asp Arg Ser Ser Ala Val Ala Cys Ile Ser Leu Pro Leu Leu 60 65 70 cca cgg atc gag gaa gct ctg ggg gcc tct ggg ctg gac gcc ttg gaa 354 Pro Arg Ile Glu Glu Ala Leu Gly Ala Ser Gly Leu Asp Ala Leu Glu 75 80 85 gtc agc gag gtc gac cct cgg gcc agc cgg gcc gcc att gta ccc ctc 402 Val Ser Glu Val Asp Pro Arg Ala Ser Arg Ala Ala Ile Val Pro Leu 90 95 100 105 aaa gac agc ctg aac cac ctc aac ctg ccc cca ctg ctg gtg ctg ccc 450 Lys Asp Ser Leu Asn His Leu Asn Leu Pro Pro Leu Leu Val Leu Pro 110 115 120 aca cgg tgg tcc ccg acc ttg ggc cca gac cgg cac ggg gct ccg gaa 498 Thr Arg Trp Ser Pro Thr Leu Gly Pro Asp Arg His Gly Ala Pro Glu 125 130 135 aaa ctg ctt ggg gct gag cgg atg ccc cga gcc ccg ggc ggc ttt gag 546 Lys Leu Leu Gly Ala Glu Arg Met Pro Arg Ala Pro Gly Gly Phe Glu 140 145 150 tgc ttc cac tgc cac aaa ccc tac cac acg ctg gcc ggg ctg gcc agg 594 Cys Phe His Cys His Lys Pro Tyr His Thr Leu Ala Gly Leu Ala Arg 155 160 165 cac cgg cag ctg cac tgc cac ctg cag gtg ggg cgt gtc ttc acc tgc 642 His Arg Gln Leu His Cys His Leu Gln Val Gly Arg Val Phe Thr Cys 170 175 180 185 aag tac tgc gac aag gag tac acc agc ctg ggt gcc ctc aag atg cac 690 Lys Tyr Cys Asp Lys Glu Tyr Thr Ser Leu Gly Ala Leu Lys Met His 190 195 200 atc cgc act cac acg ctg ccc tgc acc tgc aag atc tgt ggc aag gcc 738 Ile Arg Thr His Thr Leu Pro Cys Thr Cys Lys Ile Cys Gly Lys Ala 205 210 215 ttc tcc agg ccc tgg tta ctg cag ggc cat gtc cgc acc cac aca ggg 786 Phe Ser Arg Pro Trp Leu Leu Gln Gly His Val Arg Thr His Thr Gly 220 225 230 gag aag ccc tat gcc tgc tcg cac tgc agc agg gcc ttt gcc gac cgc 834 Glu Lys Pro Tyr Ala Cys Ser His Cys Ser Arg Ala Phe Ala Asp Arg 235 240 245 tcc aac ctt cgg gcc cat ctg caa acg cac tca gac gcc aag aag tac 882 Ser Asn Leu Arg Ala His Leu Gln Thr His Ser Asp Ala Lys Lys Tyr 250 255 260 265 cgg tgc cgg cgc tgc acc aag acc ttc tcc cgc atg tcc ctc ctg gcg 930 Arg Cys Arg Arg Cys Thr Lys Thr Phe Ser Arg Met Ser Leu Leu Ala 270 275 280 cgg cat gag gaa gtc aga gcc tcg ggg agg tgg ccg cca gca tgg gcc 978 Arg His Glu Glu Val Arg Ala Ser Gly Arg Trp Pro Pro Ala Trp Ala 285 290 295 ggc act gcc gcc gga tgg ctg gca agg ctg cct agt tcc att gca gca 1026 Gly Thr Ala Ala Gly Trp Leu Ala Arg Leu Pro Ser Ser Ile Ala Ala 300 305 310 gaa atg aac agt tct gac tta tag tgagcaccgc cctgtggccc ttcctcagta 1080 Glu Met Asn Ser Ser Asp Leu 315 320 ggcacaacta cctctcagcc agcccccgcc agcctttggt ttggggtctg ggacgagctg 1140 ccccatgtca cacgtctatg tgcatgtgca cacacactca aacatgtaca cacacgtgcc 1200 ctccccacct cactagactc tccgggagat ggggcaggac tgggagagcc cacgattggt 1260 gatttgggtg tgttgggatg aggcggagtg cctgtgggat ttgtcccggt cagagcctca 1320 gggggctggg gtctcagggc actcagcttc ccaggcaata acagccgtgg ggtaataaat 1380 ggtctctgca c 1391 10 320 PRT Homo sapiens 10 Met Pro Arg Ser Phe Leu Val Lys Thr His Ser Ser His Arg Val Pro 1 5 10 15 Asn Tyr Arg Arg Leu Glu Thr Gln Arg Glu Ile Asn Gly Ala Cys Ser 20 25 30 Ala Cys Gly Gly Leu Val Val Pro Leu Leu Pro Arg Asp Lys Glu Ala 35 40 45 Pro Ser Val Pro Gly Asp Leu Pro Gln Pro Trp Asp Arg Ser Ser Ala 50 55 60 Val Ala Cys Ile Ser Leu Pro Leu Leu Pro Arg Ile Glu Glu Ala Leu 65 70 75 80 Gly Ala Ser Gly Leu Asp Ala Leu Glu Val Ser Glu Val Asp Pro Arg 85 90 95 Ala Ser Arg Ala Ala Ile Val Pro Leu Lys Asp Ser Leu Asn His Leu 100 105 110 Asn Leu Pro Pro Leu Leu Val Leu Pro Thr Arg Trp Ser Pro Thr Leu 115 120 125 Gly Pro Asp Arg His Gly Ala Pro Glu Lys Leu Leu Gly Ala Glu Arg 130 135 140 Met Pro Arg Ala Pro Gly Gly Phe Glu Cys Phe His Cys His Lys Pro 145 150 155 160 Tyr His Thr Leu Ala Gly Leu Ala Arg His Arg Gln Leu His Cys His 165 170 175 Leu Gln Val Gly Arg Val Phe Thr Cys Lys Tyr Cys Asp Lys Glu Tyr 180 185 190 Thr Ser Leu Gly Ala Leu Lys Met His Ile Arg Thr His Thr Leu Pro 195 200 205 Cys Thr Cys Lys Ile Cys Gly Lys Ala Phe Ser Arg Pro Trp Leu Leu 210 215 220 Gln Gly His Val Arg Thr His Thr Gly Glu Lys Pro Tyr Ala Cys Ser 225 230 235 240 His Cys Ser Arg Ala Phe Ala Asp Arg Ser Asn Leu Arg Ala His Leu 245 250 255 Gln Thr His Ser Asp Ala Lys Lys Tyr Arg Cys Arg Arg Cys Thr Lys 260 265 270 Thr Phe Ser Arg Met Ser Leu Leu Ala Arg His Glu Glu Val Arg Ala 275 280 285 Ser Gly Arg Trp Pro Pro Ala Trp Ala Gly Thr Ala Ala Gly Trp Leu 290 295 300 Ala Arg Leu Pro Ser Ser Ile Ala Ala Glu Met Asn Ser Ser Asp Leu 305 310 315 320 11 1854 DNA Homo sapiens CDS (75)..(788) 11 ggcacagggc tgtgactagc gggccggccc gggccaggac agcgggcggc gggcggcgcg 60 ggcctggccc cggg atg gct atg ttc cgc agc ctg gtg gcc tcg gct cag 110 Met Ala Met Phe Arg Ser Leu Val Ala Ser Ala Gln 1 5 10 cag cgg cag ccg ccg gcc ggg ccg gcg ggc ggc gac agc ggc ctg gag 158 Gln Arg Gln Pro Pro Ala Gly Pro Ala Gly Gly Asp Ser Gly Leu Glu 15 20 25 gcg cag tac acc tgc ccc atc tgc ctg gag gtc tat cac cgg ccc gtg 206 Ala Gln Tyr Thr Cys Pro Ile Cys Leu Glu Val Tyr His Arg Pro Val 30 35 40 gcc atc ggc agc tgc ggc cac acg ttc tgc ggg gag tgt ctc cag ccc 254 Ala Ile Gly Ser Cys Gly His Thr Phe Cys Gly Glu Cys Leu Gln Pro 45 50 55 60 tgc ctg cag gtg cca tcc ccg ctg tgc cca ctg tgc cgc ctg ccc ttc 302 Cys Leu Gln Val Pro Ser Pro Leu Cys Pro Leu Cys Arg Leu Pro Phe 65 70 75 gac ccc aag aag gtg gac aag gcc acc cac gtg gag aag cag ctc tca 350 Asp Pro Lys Lys Val Asp Lys Ala Thr His Val Glu Lys Gln Leu Ser 80 85 90 tcc tac aaa gcg ccc tgt cga ggc tgc aac aaa aag gtg acc ctg gca 398 Ser Tyr Lys Ala Pro Cys Arg Gly Cys Asn Lys Lys Val Thr Leu Ala 95 100 105 aag atg aga gtg cac att tcg tcc tgc ctg aag gtc cag gag cag atg 446 Lys Met Arg Val His Ile Ser Ser Cys Leu Lys Val Gln Glu Gln Met 110 115 120 gcc aac tgc ccc aag ttc gtc ccc gtg gtg ccc aca tca cag cct atc 494 Ala Asn Cys Pro Lys Phe Val Pro Val Val Pro Thr Ser Gln Pro Ile 125 130 135 140 ccc agc aac atc ccc aac agg tcc acc ttc gcc tgc ccg tac tgt ggt 542 Pro Ser Asn Ile Pro Asn Arg Ser Thr Phe Ala Cys Pro Tyr Cys Gly 145 150 155 gcc cgc aac ctg gac cag cag gag ctg gtg aag cac tgt gtg gaa agc 590 Ala Arg Asn Leu Asp Gln Gln Glu Leu Val Lys His Cys Val Glu Ser 160 165 170 cac cgc agc gac ccc aac cgc gtg gtg tgc ccc atc tgc tcg gca atg 638 His Arg Ser Asp Pro Asn Arg Val Val Cys Pro Ile Cys Ser Ala Met 175 180 185 ccc tgg ggg gac ccc agc tac aag agc gcc aac ttc ctg cag cac ctg 686 Pro Trp Gly Asp Pro Ser Tyr Lys Ser Ala Asn Phe Leu Gln His Leu 190 195 200 ctt cac cga cac aag ttc tcc tac gac acc ttt gtg gac tac agt att 734 Leu His Arg His Lys Phe Ser Tyr Asp Thr Phe Val Asp Tyr Ser Ile 205 210 215 220 gac gag gag gcc gcc ttc cag gct gct ctg gcc ctg tct ctc tct gag 782 Asp Glu Glu Ala Ala Phe Gln Ala Ala Leu Ala Leu Ser Leu Ser Glu 225 230 235 aac tga agggaagcgc agccacccgc ctgcgtctgg ggtcagggat gtccccgctc 838 Asn ctgtgtcgca cctggcacct gctcgggagc gcacctcacc ggactgagct cacaggagga 898 gcctgcaccc gcgcagaagg ggagccgggg ccgagcctcc gggcctgaat acgggccagc 958 cgccgaggcc gccagagcag ggccgcctgg tcccaccggc gtcgctgggt tcttcggtgc 1018 ttctggccga gcaggcggcc tacttgggca gggctggacg ctgggacctg gagctgccgc 1078 cgtctcttca aagccatgat accccctcgt gggaagaagg gaccgacgcg cgagtcgcgc 1138 tccgcagtcg agccgggagg aacccaggct gctgccctgc ccagcccgac cctgccccgg 1198 ccccgcttcc accttgcgca tttggtactg gcttttgtga tacttaggaa ccctggcatc 1258 ttttctatat tatccagtgt gataatcttt tcacgtttta tagagcaaag acagagcagt 1318 tactcttcat attgcaatat ctgtgtttga ctaggaataa tagtattttt atggaacatt 1378 tacaaaatta tattttttaa gaaaacaatc aaaacaagca ttgggggatt ggggcaagga 1438 tggaaggagc agtggggcag ctgccagagc tcaggcgagc catggggtct gctgtggggt 1498 ctgccctggc cacccactgt gtgtctgggt ccttgaggtt tgtacgtttc tctttgatga 1558 ccaggaagaa atcccagcac cccagccaca ggctgtggct gctcccagca gaggtggggc 1618 cggcagagaa ggggcctcct ccacccagag tcctggcctt ggcccgctgt caccttcaaa 1678 gctgactgtg ccccgctgcg ggaggggacg gcaccccagt ggtggcagag cttgggggcc 1738 tgggcagggg cccgcttggc gggccgggca acacgtcaac attcttttct gttcttggca 1798 ttaattattg ctgtcttttt ttaaaaaaaa aaagtttaaa taaaatgtct cagagc 1854 12 237 PRT Homo sapiens 12 Met Ala Met Phe Arg Ser Leu Val Ala Ser Ala Gln Gln Arg Gln Pro 1 5 10 15 Pro Ala Gly Pro Ala Gly Gly Asp Ser Gly Leu Glu Ala Gln Tyr Thr 20 25 30 Cys Pro Ile Cys Leu Glu Val Tyr His Arg Pro Val Ala Ile Gly Ser 35 40 45 Cys Gly His Thr Phe Cys Gly Glu Cys Leu Gln Pro Cys Leu Gln Val 50 55 60 Pro Ser Pro Leu Cys Pro Leu Cys Arg Leu Pro Phe Asp Pro Lys Lys 65 70 75 80 Val Asp Lys Ala Thr His Val Glu Lys Gln Leu Ser Ser Tyr Lys Ala 85 90 95 Pro Cys Arg Gly Cys Asn Lys Lys Val Thr Leu Ala Lys Met Arg Val 100 105 110 His Ile Ser Ser Cys Leu Lys Val Gln Glu Gln Met Ala Asn Cys Pro 115 120 125 Lys Phe Val Pro Val Val Pro Thr Ser Gln Pro Ile Pro Ser Asn Ile 130 135 140 Pro Asn Arg Ser Thr Phe Ala Cys Pro Tyr Cys Gly Ala Arg Asn Leu 145 150 155 160 Asp Gln Gln Glu Leu Val Lys His Cys Val Glu Ser His Arg Ser Asp 165 170 175 Pro Asn Arg Val Val Cys Pro Ile Cys Ser Ala Met Pro Trp Gly Asp 180 185 190 Pro Ser Tyr Lys Ser Ala Asn Phe Leu Gln His Leu Leu His Arg His 195 200 205 Lys Phe Ser Tyr Asp Thr Phe Val Asp Tyr Ser Ile Asp Glu Glu Ala 210 215 220 Ala Phe Gln Ala Ala Leu Ala Leu Ser Leu Ser Glu Asn 225 230 235 13 5360 DNA Homo sapiens CDS (1601)..(3919) 13 cttccagtct ctgctgatgt gatttcagat gggcgcggct ccagaccatc ccctgcaatg 60 gccagttacg cagcctctcc gagccactgc ctctctgtgg aaggagggcc tgaggctgac 120 ggggagcagc cgcctcgctt ggccactctg ggacctgggg tgatggaggg tgcagcggag 180 actgaccagg aggctctgtg tgcaggggag actggggccc agaagccacc tggagatcgg 240 atgctgtgtc cagggaggat ggatggtgca gctctggggg aacagccaac tgggcagaag 300 ggagcctcgg caagggggtt ctggggacca agagagacca aggcgttggg tgtgtgcaaa 360 gagtctggga gcgagcctgc ggaggacagc agcagggccc acagccgatc agaggaaggt 420 gtctgggagg agaacacgcc ccccttgggc cccctgggtt ttcccgagac ttccagctct 480 ccggcggaca gcaccaccag cagctgcctc cagggcctcc cggacaaccc agacacccag 540 ggtggagtcc aggggcctga aggccccact cctgatgcct ctggctccag tgccaaggat 600 cctccaagct tgtttgatga tgaggtctct ttctcccagc tcttccctcc aggcggtcgc 660 ttgactagaa agaggaaccc gcgtgtctac gggaagcgct gtgagaagcc ggtgctcccg 720 ctgccaaccc agcccagctt tgaggagggc ggtgacccca cgctgggccc agcccgcctg 780 cccacggacc tcagcgactc cagctccctc tgcctctgcc atgaggaccc gtgggaggac 840 gaggatcccg caggtctgcc cgagtccttc ctcctggatg ggttcctcaa tagcagggtg 900 cctggcattg acccctgggc ccccggcctc agcctgtggg ccctggagcc cagcagggaa 960 gctggtgcag agaagctgcc ctcccactgc cccgaggacg atcggccgga ggccattcct 1020 gagctgcaca tggtcccagc ggcttggcga ggcctggaga tgccggcccc tgccgatgac 1080 tcctcctctt ctctcggaga tgtgagcccc gagcccccca gcctggagag agaacgctgt 1140 gacggtgggc ttcccgggaa cacccacctg ctgccgctcc gtgccacgga ctttgaggtg 1200 ctcagcacca agtttgagat gcaagacctg tgctttctgg gaccctttga agaccccgtg 1260 ggtctccccg gccccagctt cttagacttc gagggcacgg cgagctcaca ggggccacag 1320 agccgaagga cagaggaggc tgcaggggca gggagggccc aaggcagagg ccggccggcc 1380 aagggcaggc gggcctccta caagtgcaaa gtgtgcttcc agcgcttccg cagcctgggc 1440 gagctggacc tgcacaagct ggcccacacg cccgcgccgc cgcccacctg ctacatgtgc 1500 gtggagcgca ggtttggctc gcgggagctg ctgcgggggc acctgcagga gaggcacgcg 1560 cagagcaagg ccgggccctg ggcgtgcggc atgtgcctga agg agg tgg ccg acg 1615 Arg Arg Trp Pro Thr 1 5 tct gga tgt aca acg agc acc tgc gtg agc acg cgg tcc gct tcg ccc 1663 Ser Gly Cys Thr Thr Ser Thr Cys Val Ser Thr Arg Ser Ala Ser Pro 10 15 20 gca ggg ggg cag gcg cgg agg tcc ttg ggg gac ctg ccc gga ggc ctg 1711 Ala Gly Gly Gln Ala Arg Arg Ser Leu Gly Asp Leu Pro Gly Gly Leu 25 30 35 gag ggc agc agc gct gtc gcc cac ctt ctg aac agc atc acg gaa ccc 1759 Glu Gly Ser Ser Ala Val Ala His Leu Leu Asn Ser Ile Thr Glu Pro 40 45 50 gcg ccc aaa cac cac agg ggc aag cgc tcc gcc ggc aag gcc gcc ggg 1807 Ala Pro Lys His His Arg Gly Lys Arg Ser Ala Gly Lys Ala Ala Gly 55 60 65 agc ccg gga gac ccg tgg ggg caa gag gga gaa gcc aag aaa gac agc 1855 Ser Pro Gly Asp Pro Trp Gly Gln Glu Gly Glu Ala Lys Lys Asp Ser 70 75 80 85 ccg ggc gag agg gcg aaa ccc cgg gca cgc agc acc ccc agc aac cca 1903 Pro Gly Glu Arg Ala Lys Pro Arg Ala Arg Ser Thr Pro Ser Asn Pro 90 95 100 gac ggg gcc gcg acc cca gac agc gcc tct gcc acc gcc ctg gct gac 1951 Asp Gly Ala Ala Thr Pro Asp Ser Ala Ser Ala Thr Ala Leu Ala Asp 105 110 115 gcc ggc agc ccg ggc ccc ccc agg acg acc ccc agc ccg tcc ccc gac 1999 Ala Gly Ser Pro Gly Pro Pro Arg Thr Thr Pro Ser Pro Ser Pro Asp 120 125 130 ccc tgg gcc ggc ggg gag ccc ctc ctg caa gcc acc ccg gtg cac gag 2047 Pro Trp Ala Gly Gly Glu Pro Leu Leu Gln Ala Thr Pro Val His Glu 135 140 145 gcc tgc aag gac ccc tcc cgc gac tgc cac cac tgc ggg aag cgc ttc 2095 Ala Cys Lys Asp Pro Ser Arg Asp Cys His His Cys Gly Lys Arg Phe 150 155 160 165 ccc aag ccc ttc aag ctg cag cgc cac ctg gcg gtg cac agc ccg cag 2143 Pro Lys Pro Phe Lys Leu Gln Arg His Leu Ala Val His Ser Pro Gln 170 175 180 cgc gtc tac ctg tgc ccc cgg tgc ccc cgg gtc tac ccc gag cac ggg 2191 Arg Val Tyr Leu Cys Pro Arg Cys Pro Arg Val Tyr Pro Glu His Gly 185 190 195 gag ctg ctg gca cac ctg ggc ggg gcg cac ggg ctg ctg gag cgg ccg 2239 Glu Leu Leu Ala His Leu Gly Gly Ala His Gly Leu Leu Glu Arg Pro 200 205 210 gag ctg cag cac acg ccg ctg tat gcc tgc gag ctc tgc gcc acg gtt 2287 Glu Leu Gln His Thr Pro Leu Tyr Ala Cys Glu Leu Cys Ala Thr Val 215 220 225 atg cgc atc atc aag aag tcc ttc gcc tgc agc tcc tgc aac tac acc 2335 Met Arg Ile Ile Lys Lys Ser Phe Ala Cys Ser Ser Cys Asn Tyr Thr 230 235 240 245 ttc gcc aag aag gag cag ttc gac cgc cac atg aac aag cac ctc agg 2383 Phe Ala Lys Lys Glu Gln Phe Asp Arg His Met Asn Lys His Leu Arg 250 255 260 ggg ggg cgg cag ccc ttc gcg ttc cgc ggc gtg cgg agg ccg gga gcg 2431 Gly Gly Arg Gln Pro Phe Ala Phe Arg Gly Val Arg Arg Pro Gly Ala 265 270 275 ccg gga cag aag gcc cgg gcc ctc gag ggc aca ctg ccc agc aaa cgg 2479 Pro Gly Gln Lys Ala Arg Ala Leu Glu Gly Thr Leu Pro Ser Lys Arg 280 285 290 cgc agg gtg gcc atg ccc ggc agt gcc cct ggg ccc ggc gag gac agg 2527 Arg Arg Val Ala Met Pro Gly Ser Ala Pro Gly Pro Gly Glu Asp Arg 295 300 305 cct cct ccc cgg gga agc agc ccc atc ctg agt gag ggc tct ctc ccg 2575 Pro Pro Pro Arg Gly Ser Ser Pro Ile Leu Ser Glu Gly Ser Leu Pro 310 315 320 325 gcc ctg ctc cac ctg tgt tcg gag gtg gct ccc agc acc acc aag gga 2623 Ala Leu Leu His Leu Cys Ser Glu Val Ala Pro Ser Thr Thr Lys Gly 330 335 340 tgg ccc gag acc cta gag agg cct gta gac ccc gtg acc cac ccg atc 2671 Trp Pro Glu Thr Leu Glu Arg Pro Val Asp Pro Val Thr His Pro Ile 345 350 355 aga ggt tgt gag ctg cca tcc aac cac cag gag tgt ccc ccg ccg tct 2719 Arg Gly Cys Glu Leu Pro Ser Asn His Gln Glu Cys Pro Pro Pro Ser 360 365 370 ctg tct ccc ttc cca gct gcc ttg gct gat ggc aga gga gac tgc gcg 2767 Leu Ser Pro Phe Pro Ala Ala Leu Ala Asp Gly Arg Gly Asp Cys Ala 375 380 385 ctg gac gga gcc ctg gag agg cca gag aac gag gct tcc cca ggc agc 2815 Leu Asp Gly Ala Leu Glu Arg Pro Glu Asn Glu Ala Ser Pro Gly Ser 390 395 400 405 ccc ggg cct ctt ctc cag caa gct ctc cct ctg ggg gca tct ctg ccg 2863 Pro Gly Pro Leu Leu Gln Gln Ala Leu Pro Leu Gly Ala Ser Leu Pro 410 415 420 cgg ccg gga gcc aga ggc caa gat gcg gag gga aag agg gct cct ctc 2911 Arg Pro Gly Ala Arg Gly Gln Asp Ala Glu Gly Lys Arg Ala Pro Leu 425 430 435 gtg ttc tca ggg aaa cgc agg gcc ccg ggt gcc cgt ggc agg tgt gcc 2959 Val Phe Ser Gly Lys Arg Arg Ala Pro Gly Ala Arg Gly Arg Cys Ala 440 445 450 cct gac cat ttc cag gaa gac cac cta ctt cag aaa gag aag gag gtg 3007 Pro Asp His Phe Gln Glu Asp His Leu Leu Gln Lys Glu Lys Glu Val 455 460 465 tcc tca agc cac atg gtg tct gag ggg ggg ccc cga ggc gcc ttc cac 3055 Ser Ser Ser His Met Val Ser Glu Gly Gly Pro Arg Gly Ala Phe His 470 475 480 485 aag ggc agc gcc acc aag cct gcg ggc tgc cag agc tca tca aag gac 3103 Lys Gly Ser Ala Thr Lys Pro Ala Gly Cys Gln Ser Ser Ser Lys Asp 490 495 500 agg tcg gca gca tcc acc ccc agc aaa gca ctc aag ttc cca gtg cac 3151 Arg Ser Ala Ala Ser Thr Pro Ser Lys Ala Leu Lys Phe Pro Val His 505 510 515 cca agg aag gcg gtg ggg agc ctg gca ccc ggg gag ctg gcc cgt ggc 3199 Pro Arg Lys Ala Val Gly Ser Leu Ala Pro Gly Glu Leu Ala Arg Gly 520 525 530 aca gag aat ggg atg aag ccc gcc acc ccc aaa gcc aaa ccc ggc ccc 3247 Thr Glu Asn Gly Met Lys Pro Ala Thr Pro Lys Ala Lys Pro Gly Pro 535 540 545 agc tcc cag ggc agt gga agc cct cgc ccc ggc acc aag aca gga ggt 3295 Ser Ser Gln Gly Ser Gly Ser Pro Arg Pro Gly Thr Lys Thr Gly Gly 550 555 560 565 ggc agc cag ccc cag cca gcc agc ggg cag ctc cag agc gag aca gcc 3343 Gly Ser Gln Pro Gln Pro Ala Ser Gly Gln Leu Gln Ser Glu Thr Ala 570 575 580 acc acc cca gcc aag ccc agc ttc ccc agc cgg agc cct gca cca gag 3391 Thr Thr Pro Ala Lys Pro Ser Phe Pro Ser Arg Ser Pro Ala Pro Glu 585 590 595 agg ctc ccc gct cga gcc caa gcc aag agc tgc acc aag ggg cca agg 3439 Arg Leu Pro Ala Arg Ala Gln Ala Lys Ser Cys Thr Lys Gly Pro Arg 600 605 610 gaa gct ggt gag cag ggg ccc cac ggg agc cta ggt ccc aag gag aag 3487 Glu Ala Gly Glu Gln Gly Pro His Gly Ser Leu Gly Pro Lys Glu Lys 615 620 625 gga gag agc agt acg aag agg aaa aag ggc cag gtc cca ggg cca gcc 3535 Gly Glu Ser Ser Thr Lys Arg Lys Lys Gly Gln Val Pro Gly Pro Ala 630 635 640 645 agg agt gaa agt gtg ggg agc ttc ggg aga gcc ccc tca gcc cct gac 3583 Arg Ser Glu Ser Val Gly Ser Phe Gly Arg Ala Pro Ser Ala Pro Asp 650 655 660 aag ccc ccc cgg acc cct cgg aag cag gca act ccc agc cgc gtg ctc 3631 Lys Pro Pro Arg Thr Pro Arg Lys Gln Ala Thr Pro Ser Arg Val Leu 665 670 675 ccg acc aag ccc aag ccc aac agc cag aac aaa ccc agg ccg cca cca 3679 Pro Thr Lys Pro Lys Pro Asn Ser Gln Asn Lys Pro Arg Pro Pro Pro 680 685 690 tca gag cag cgg aag gca gag ccg ggc cac aca cag agg aag gac aga 3727 Ser Glu Gln Arg Lys Ala Glu Pro Gly His Thr Gln Arg Lys Asp Arg 695 700 705 ctg ggc aag gcc ttc ccc cag ggg aga ccc ctg ctc agg ccc ccc aag 3775 Leu Gly Lys Ala Phe Pro Gln Gly Arg Pro Leu Leu Arg Pro Pro Lys 710 715 720 725 agg ggc aca gct gtc cac ggt gct gaa cct gcc gag ccg cac acc cac 3823 Arg Gly Thr Ala Val His Gly Ala Glu Pro Ala Glu Pro His Thr His 730 735 740 cgg acg gcc gag gcc cag agt gac ctc ctc agc cag ctc ttc ggg cag 3871 Arg Thr Ala Glu Ala Gln Ser Asp Leu Leu Ser Gln Leu Phe Gly Gln 745 750 755 aga cta act ggc ttc aaa atc cct tta aag aaa gat gct tct gag taa 3919 Arg Leu Thr Gly Phe Lys Ile Pro Leu Lys Lys Asp Ala Ser Glu 760 765 770 tttctagaag caagagcctg ggaccggagc tgggcgttcc tgtctcggcc tgcctccttg 3979 gccagctccg gctccctgag atggtccact ctgtggccac ttgacttctt gtgcaactgc 4039 tcaggccttg atgtcagagc tgaggtggtg atgctttgaa cagggcccag gtgggcagca 4099 ttccctttct tgctggaagg ctgggggtga aagacggggc cactgcagcc cttttgagac 4159 cacacagctg ttttcttggt accaagtact tgaagagaca gcagcccatc ccctcagccc 4219 acacccctgc gccctgtggg caccgacacc acagaagcca atgtttggag atttgcacaa 4279 cctcccgttc ccccacatgg agaagggaag taagttgagg cagccgtggg atggtggtag 4339 gttccctctt agtcttgctg ctgttgctgg aattccaaag tgaccttaga aaccacgtgg 4399 gggaaggcag tgctcactac ttagaagggt tgcttctgag ccgcctggtc ccccaagagc 4459 acaacaggcc tcctccctct gaccacaggg tcatgcctcc tccctctgac cacagggtca 4519 tgcctcctcc ctctgaccac agggtcatgc cagcctccat ttgcgctgcg ggagaaaagc 4579 ccatctctag cacaccttga ccccaggaac cgggttcccg tatggaactg ggaagaaacc 4639 gcccctgtgc cagctcccgc gggccctcct cgttccctcc cagcctccat ggccgccctc 4699 tagagcctcc ctgctgtacg gagctctggg ctccacctat ttgcaatgtt actctgaagt 4759 ttctggtgct atttttgtgt tgtaatgtga atacaggctt ccttgatttt ttttttaatg 4819 ggggtattgg gtgggacaga cggggtcagg gaggccccac catggcttgt cgagggcacg 4879 ggcacctgca tggcggcgct ctccctgcct cccctgccgg gctgcaagcc tgaggtctgt 4939 gctgccggac ggggatgctc agggctgggg ctgcagagcc gctgccctgg ccagggcacc 4999 ctcatgcacc gacccaaccc aggcctggga cgcacgtgtc ctctcacagc gtcgtgcctg 5059 tgaaggtggg tcaaagggtg agagggcttc cttctcaccc ttctctccat aagtatcttg 5119 aagatccatg gtttgttttg ctctattgtt tagtttttac ttgggtgcaa tgtgtacgtc 5179 aaaagttttt attttgatat ttgaaagaga ccaaatcagg cccagaccgc ctctctggaa 5239 ggtgttgtag gccattcaaa acgcctccgg agtgtcgcaa accaagtgcg gaggggccct 5299 gaggttgtac tgtaaacatc atagtgactt gtcttttcaa atatattccc actattttcg 5359 c 5360 14 772 PRT Homo sapiens 14 Arg Arg Trp Pro Thr Ser Gly Cys Thr Thr Ser Thr Cys Val Ser Thr 1 5 10 15 Arg Ser Ala Ser Pro Ala Gly Gly Gln Ala Arg Arg Ser Leu Gly Asp 20 25 30 Leu Pro Gly Gly Leu Glu Gly Ser Ser Ala Val Ala His Leu Leu Asn 35 40 45 Ser Ile Thr Glu Pro Ala Pro Lys His His Arg Gly Lys Arg Ser Ala 50 55 60 Gly Lys Ala Ala Gly Ser Pro Gly Asp Pro Trp Gly Gln Glu Gly Glu 65 70 75 80 Ala Lys Lys Asp Ser Pro Gly Glu Arg Ala Lys Pro Arg Ala Arg Ser 85 90 95 Thr Pro Ser Asn Pro Asp Gly Ala Ala Thr Pro Asp Ser Ala Ser Ala 100 105 110 Thr Ala Leu Ala Asp Ala Gly Ser Pro Gly Pro Pro Arg Thr Thr Pro 115 120 125 Ser Pro Ser Pro Asp Pro Trp Ala Gly Gly Glu Pro Leu Leu Gln Ala 130 135 140 Thr Pro Val His Glu Ala Cys Lys Asp Pro Ser Arg Asp Cys His His 145 150 155 160 Cys Gly Lys Arg Phe Pro Lys Pro Phe Lys Leu Gln Arg His Leu Ala 165 170 175 Val His Ser Pro Gln Arg Val Tyr Leu Cys Pro Arg Cys Pro Arg Val 180 185 190 Tyr Pro Glu His Gly Glu Leu Leu Ala His Leu Gly Gly Ala His Gly 195 200 205 Leu Leu Glu Arg Pro Glu Leu Gln His Thr Pro Leu Tyr Ala Cys Glu 210 215 220 Leu Cys Ala Thr Val Met Arg Ile Ile Lys Lys Ser Phe Ala Cys Ser 225 230 235 240 Ser Cys Asn Tyr Thr Phe Ala Lys Lys Glu Gln Phe Asp Arg His Met 245 250 255 Asn Lys His Leu Arg Gly Gly Arg Gln Pro Phe Ala Phe Arg Gly Val 260 265 270 Arg Arg Pro Gly Ala Pro Gly Gln Lys Ala Arg Ala Leu Glu Gly Thr 275 280 285 Leu Pro Ser Lys Arg Arg Arg Val Ala Met Pro Gly Ser Ala Pro Gly 290 295 300 Pro Gly Glu Asp Arg Pro Pro Pro Arg Gly Ser Ser Pro Ile Leu Ser 305 310 315 320 Glu Gly Ser Leu Pro Ala Leu Leu His Leu Cys Ser Glu Val Ala Pro 325 330 335 Ser Thr Thr Lys Gly Trp Pro Glu Thr Leu Glu Arg Pro Val Asp Pro 340 345 350 Val Thr His Pro Ile Arg Gly Cys Glu Leu Pro Ser Asn His Gln Glu 355 360 365 Cys Pro Pro Pro Ser Leu Ser Pro Phe Pro Ala Ala Leu Ala Asp Gly 370 375 380 Arg Gly Asp Cys Ala Leu Asp Gly Ala Leu Glu Arg Pro Glu Asn Glu 385 390 395 400 Ala Ser Pro Gly Ser Pro Gly Pro Leu Leu Gln Gln Ala Leu Pro Leu 405 410 415 Gly Ala Ser Leu Pro Arg Pro Gly Ala Arg Gly Gln Asp Ala Glu Gly 420 425 430 Lys Arg Ala Pro Leu Val Phe Ser Gly Lys Arg Arg Ala Pro Gly Ala 435 440 445 Arg Gly Arg Cys Ala Pro Asp His Phe Gln Glu Asp His Leu Leu Gln 450 455 460 Lys Glu Lys Glu Val Ser Ser Ser His Met Val Ser Glu Gly Gly Pro 465 470 475 480 Arg Gly Ala Phe His Lys Gly Ser Ala Thr Lys Pro Ala Gly Cys Gln 485 490 495 Ser Ser Ser Lys Asp Arg Ser Ala Ala Ser Thr Pro Ser Lys Ala Leu 500 505 510 Lys Phe Pro Val His Pro Arg Lys Ala Val Gly Ser Leu Ala Pro Gly 515 520 525 Glu Leu Ala Arg Gly Thr Glu Asn Gly Met Lys Pro Ala Thr Pro Lys 530 535 540 Ala Lys Pro Gly Pro Ser Ser Gln Gly Ser Gly Ser Pro Arg Pro Gly 545 550 555 560 Thr Lys Thr Gly Gly Gly Ser Gln Pro Gln Pro Ala Ser Gly Gln Leu 565 570 575 Gln Ser Glu Thr Ala Thr Thr Pro Ala Lys Pro Ser Phe Pro Ser Arg 580 585 590 Ser Pro Ala Pro Glu Arg Leu Pro Ala Arg Ala Gln Ala Lys Ser Cys 595 600 605 Thr Lys Gly Pro Arg Glu Ala Gly Glu Gln Gly Pro His Gly Ser Leu 610 615 620 Gly Pro Lys Glu Lys Gly Glu Ser Ser Thr Lys Arg Lys Lys Gly Gln 625 630 635 640 Val Pro Gly Pro Ala Arg Ser Glu Ser Val Gly Ser Phe Gly Arg Ala 645 650 655 Pro Ser Ala Pro Asp Lys Pro Pro Arg Thr Pro Arg Lys Gln Ala Thr 660 665 670 Pro Ser Arg Val Leu Pro Thr Lys Pro Lys Pro Asn Ser Gln Asn Lys 675 680 685 Pro Arg Pro Pro Pro Ser Glu Gln Arg Lys Ala Glu Pro Gly His Thr 690 695 700 Gln Arg Lys Asp Arg Leu Gly Lys Ala Phe Pro Gln Gly Arg Pro Leu 705 710 715 720 Leu Arg Pro Pro Lys Arg Gly Thr Ala Val His Gly Ala Glu Pro Ala 725 730 735 Glu Pro His Thr His Arg Thr Ala Glu Ala Gln Ser Asp Leu Leu Ser 740 745 750 Gln Leu Phe Gly Gln Arg Leu Thr Gly Phe Lys Ile Pro Leu Lys Lys 755 760 765 Asp Ala Ser Glu 770 15 2914 DNA Homo sapiens CDS (333)..(1652) 15 acagctgatc cggttcttgc ggcggtgcgt gctggcgccg gagagttccg cgcgtgtcct 60 cgggctgtcc gcggggaagc tggtccggga gtgggcgccc gcccgcctgc ctcgcgcctt 120 gcgcccccgg caccgcttcc agaagggaaa gtcgccgccg ctcgcggtag acccgggcag 180 cgcttccatt ccgcggagct ggaaatagga atccagccat ccgtgggtca ggaggaatgg 240 agactgtacc ttccacatag attcacaagc tgccctgcag tggccttggc ttcaggaaag 300 gagcattcag acgcttccgt cagcctccca gg atg gca gcc cct gac ctg gcc 353 Met Ala Ala Pro Asp Leu Ala 1 5 cac gga ggt cat gtt tct agg gac tca gtc tgc ctt cat gaa gaa cag 401 His Gly Gly His Val Ser Arg Asp Ser Val Cys Leu His Glu Glu Gln 10 15 20 aca cag gca gca ggg atg gtg gct ggc tgg ctg ata aat tgt tac cag 449 Thr Gln Ala Ala Gly Met Val Ala Gly Trp Leu Ile Asn Cys Tyr Gln 25 30 35 gac gcg gtg acc ttt gac gac gtg gct gtg gac ttc acc cag gag gaa 497 Asp Ala Val Thr Phe Asp Asp Val Ala Val Asp Phe Thr Gln Glu Glu 40 45 50 55 tgg act tta ctg gac cca tct cag aga gac ctc tac aga gat gtg atg 545 Trp Thr Leu Leu Asp Pro Ser Gln Arg Asp Leu Tyr Arg Asp Val Met 60 65 70 ctg gaa aac tac gag aac ctg gcc tca gta gaa tgg cga ctt aaa acc 593 Leu Glu Asn Tyr Glu Asn Leu Ala Ser Val Glu Trp Arg Leu Lys Thr 75 80 85 aaa ggg cca gca ctt cgg cag gat aga tct tgg ttc aga gca tca aat 641 Lys Gly Pro Ala Leu Arg Gln Asp Arg Ser Trp Phe Arg Ala Ser Asn 90 95 100 gag aca cag acg gca aga agc cac aat gga ggg cag ctc tgt gac cgc 689 Glu Thr Gln Thr Ala Arg Ser His Asn Gly Gly Gln Leu Cys Asp Arg 105 110 115 acg cag tgt gga gaa gct ttc agt gaa cac tca ggc ctc agc aca cac 737 Thr Gln Cys Gly Glu Ala Phe Ser Glu His Ser Gly Leu Ser Thr His 120 125 130 135 gtg aga act caa aat aca gga gac agt tgt gtg tct aat cat tat gaa 785 Val Arg Thr Gln Asn Thr Gly Asp Ser Cys Val Ser Asn His Tyr Glu 140 145 150 agg gac ttt ttt att cca tgc cag aaa acc ttg ttc aca att gga gag 833 Arg Asp Phe Phe Ile Pro Cys Gln Lys Thr Leu Phe Thr Ile Gly Glu 155 160 165 cag ttt tcc gtg ttg ggt cag tgt gga aaa gcc ttc agc tct act cca 881 Gln Phe Ser Val Leu Gly Gln Cys Gly Lys Ala Phe Ser Ser Thr Pro 170 175 180 aat gtt gtt tcc cag caa gca tgc act cgg gac aga tct ctt gac tac 929 Asn Val Val Ser Gln Gln Ala Cys Thr Arg Asp Arg Ser Leu Asp Tyr 185 190 195 agc agc tgt ggg gaa gtg ttt ctt aat cag tca tac ctt cag gca cgt 977 Ser Ser Cys Gly Glu Val Phe Leu Asn Gln Ser Tyr Leu Gln Ala Arg 200 205 210 215 gcg gga agt cac aac gga gaa gaa aca tgg aaa tgg aag ccg tgt ggg 1025 Ala Gly Ser His Asn Gly Glu Glu Thr Trp Lys Trp Lys Pro Cys Gly 220 225 230 aaa gct cta act cac tcc atg ggc tgc gcc aca cct gtt gaa atg cat 1073 Lys Ala Leu Thr His Ser Met Gly Cys Ala Thr Pro Val Glu Met His 235 240 245 gcc gtc agg aat ccc cac gta tgt agg gaa tgt ggg aag gcc ttt agg 1121 Ala Val Arg Asn Pro His Val Cys Arg Glu Cys Gly Lys Ala Phe Arg 250 255 260 tac act gcc tac ctt act ggt cgc gtg caa gtc cac cct ggg gag aag 1169 Tyr Thr Ala Tyr Leu Thr Gly Arg Val Gln Val His Pro Gly Glu Lys 265 270 275 ccc tgt gaa ttg gaa gaa tgt gga aaa gcc tcc cct gtt tct tcc agc 1217 Pro Cys Glu Leu Glu Glu Cys Gly Lys Ala Ser Pro Val Ser Ser Ser 280 285 290 295 cta act caa cat gta aga att cat gct gca gag aaa ccc tgt gaa tgt 1265 Leu Thr Gln His Val Arg Ile His Ala Ala Glu Lys Pro Cys Glu Cys 300 305 310 aaa gaa tgc gga aaa gcc ttc act gga ctc tca ggt ctt tct aaa cac 1313 Lys Glu Cys Gly Lys Ala Phe Thr Gly Leu Ser Gly Leu Ser Lys His 315 320 325 gtc caa aca gac cct gga cag aag ccc tat gaa tgt aag gac tgt ggg 1361 Val Gln Thr Asp Pro Gly Gln Lys Pro Tyr Glu Cys Lys Asp Cys Gly 330 335 340 aaa gcc tgc ggt ggg ttt tat cta ctg aat gag cat gga aaa act cac 1409 Lys Ala Cys Gly Gly Phe Tyr Leu Leu Asn Glu His Gly Lys Thr His 345 350 355 acg agg gag aag ccc ttt gca tgt gtg gtt tgc gga aaa tat ttt aga 1457 Thr Arg Glu Lys Pro Phe Ala Cys Val Val Cys Gly Lys Tyr Phe Arg 360 365 370 375 aat tcc tca tgc ctt aat aat cat gtt cga att cac act gga ata aaa 1505 Asn Ser Ser Cys Leu Asn Asn His Val Arg Ile His Thr Gly Ile Lys 380 385 390 ccc ata cat gca gct act gtg gga agg cct tca ctg tgc gct gtg gcc 1553 Pro Ile His Ala Ala Thr Val Gly Arg Pro Ser Leu Cys Ala Val Ala 395 400 405 tta cta gac acg tac gaa cac aca cgg gcg aga agc cat aca cgt gta 1601 Leu Leu Asp Thr Tyr Glu His Thr Arg Ala Arg Ser His Thr Arg Val 410 415 420 agg act gcg gga aag cct tct gta cat cct cgg gcc tta ctg agc atg 1649 Arg Thr Ala Gly Lys Pro Ser Val His Pro Arg Ala Leu Leu Ser Met 425 430 435 taa ggactcacac tggagagaaa ccatatgaat gtaaagattg tgggaaatcc 1702 ttcactgttt cttcaagcct gactgagcac gcaagaatcc ataccggaga gaaaccctac 1762 gaatgtaagc agtgtggcaa agccttcaca gggcgctcag gcctcactaa acacatgcgg 1822 acacacaccg gggagaagcc ctatgaatgt aaggactgtg ggaaagccta caatagggtt 1882 tatctactga atgagcatgt gaaaactcac acagaggaga agccctttac atgtacggta 1942 tgcaggaaat ccttcagaaa ttcctcgtgc ctgaataagc acattcatat tcacactgga 2002 ataaaacctt atgaatgtaa ggactgtggg aaaacattca ctgtttcttc gagcctaact 2062 gagcacatac gaactcacac tggagagaaa ccttatgaat gtaaagtatg cggaaaggcc 2122 ttcaccacat cctcacacct tatcgtgcac ataagaaccc acaccggtga gaaaccctac 2182 atatgtaagg agtgtgggaa agcctttgct tcctcctcac accttatcga acacagaagg 2242 actcatacag gagagaaacc ttacatatgt aacgagtgtg ggaaagcctt ccgtgcctcc 2302 tctcacctgc ataaacatgg aagaattcac actgggcaga aaccctataa atgtaaggaa 2362 tgtgggaaag catacaatag gttttatcta ctaaaagaac atttaaaaac ttacactgaa 2422 gagcaggttt ttgtatgtaa ggactgtgga aaatctttta agaattcctc atgccttaac 2482 catcacactc aaattcacac tgatgagaaa cctttctaat gtaaagaatg tggggaagct 2542 gtcagctaca ctcattcacg ttgaagacat gaaagacctc tcgttctcca gatgtccatg 2602 acttgaggaa tgtggctagg caatcagcat ctcatcacaa cccggcaggc aggaactcac 2662 cctggagccc tatgcagcag acacagagaa agccctcagt gttctctaag gtcttgctga 2722 atatggatgg tatccagtag agagaactct attgatgtgt gatatgcagc agcattgttg 2782 ctgttctgct cagtactttg atgatccctt gtgatctcac aatgaagaaa aaccttagga 2842 gggtgagaat tgttgggaag tcatttttta cattacatct ttcctcaccc tttagtaaac 2902 gtggtgattg ac 2914 16 439 PRT Homo sapiens 16 Met Ala Ala Pro Asp Leu Ala His Gly Gly His Val Ser Arg Asp Ser 1 5 10 15 Val Cys Leu His Glu Glu Gln Thr Gln Ala Ala Gly Met Val Ala Gly 20 25 30 Trp Leu Ile Asn Cys Tyr Gln Asp Ala Val Thr Phe Asp Asp Val Ala 35 40 45 Val Asp Phe Thr Gln Glu Glu Trp Thr Leu Leu Asp Pro Ser Gln Arg 50 55 60 Asp Leu Tyr Arg Asp Val Met Leu Glu Asn Tyr Glu Asn Leu Ala Ser 65 70 75 80 Val Glu Trp Arg Leu Lys Thr Lys Gly Pro Ala Leu Arg Gln Asp Arg 85 90 95 Ser Trp Phe Arg Ala Ser Asn Glu Thr Gln Thr Ala Arg Ser His Asn 100 105 110 Gly Gly Gln Leu Cys Asp Arg Thr Gln Cys Gly Glu Ala Phe Ser Glu 115 120 125 His Ser Gly Leu Ser Thr His Val Arg Thr Gln Asn Thr Gly Asp Ser 130 135 140 Cys Val Ser Asn His Tyr Glu Arg Asp Phe Phe Ile Pro Cys Gln Lys 145 150 155 160 Thr Leu Phe Thr Ile Gly Glu Gln Phe Ser Val Leu Gly Gln Cys Gly 165 170 175 Lys Ala Phe Ser Ser Thr Pro Asn Val Val Ser Gln Gln Ala Cys Thr 180 185 190 Arg Asp Arg Ser Leu Asp Tyr Ser Ser Cys Gly Glu Val Phe Leu Asn 195 200 205 Gln Ser Tyr Leu Gln Ala Arg Ala Gly Ser His Asn Gly Glu Glu Thr 210 215 220 Trp Lys Trp Lys Pro Cys Gly Lys Ala Leu Thr His Ser Met Gly Cys 225 230 235 240 Ala Thr Pro Val Glu Met His Ala Val Arg Asn Pro His Val Cys Arg 245 250 255 Glu Cys Gly Lys Ala Phe Arg Tyr Thr Ala Tyr Leu Thr Gly Arg Val 260 265 270 Gln Val His Pro Gly Glu Lys Pro Cys Glu Leu Glu Glu Cys Gly Lys 275 280 285 Ala Ser Pro Val Ser Ser Ser Leu Thr Gln His Val Arg Ile His Ala 290 295 300 Ala Glu Lys Pro Cys Glu Cys Lys Glu Cys Gly Lys Ala Phe Thr Gly 305 310 315 320 Leu Ser Gly Leu Ser Lys His Val Gln Thr Asp Pro Gly Gln Lys Pro 325 330 335 Tyr Glu Cys Lys Asp Cys Gly Lys Ala Cys Gly Gly Phe Tyr Leu Leu 340 345 350 Asn Glu His Gly Lys Thr His Thr Arg Glu Lys Pro Phe Ala Cys Val 355 360 365 Val Cys Gly Lys Tyr Phe Arg Asn Ser Ser Cys Leu Asn Asn His Val 370 375 380 Arg Ile His Thr Gly Ile Lys Pro Ile His Ala Ala Thr Val Gly Arg 385 390 395 400 Pro Ser Leu Cys Ala Val Ala Leu Leu Asp Thr Tyr Glu His Thr Arg 405 410 415 Ala Arg Ser His Thr Arg Val Arg Thr Ala Gly Lys Pro Ser Val His 420 425 430 Pro Arg Ala Leu Leu Ser Met 435 17 2254 DNA Homo sapiens CDS (36)..(626) 17 ctccctctcc gagtgctgct cctgtcattg tggcc atg gac gat acc ctg ttc 53 Met Asp Asp Thr Leu Phe 1 5 cag ttg aag ttc acg gcg aag cag ctg gag aag ctg gcc aag aag gcg 101 Gln Leu Lys Phe Thr Ala Lys Gln Leu Glu Lys Leu Ala Lys Lys Ala 10 15 20 gag aag gac tcc aag gcg gag cag gcc aaa gtg aag aag gcc ctt ctg 149 Glu Lys Asp Ser Lys Ala Glu Gln Ala Lys Val Lys Lys Ala Leu Leu 25 30 35 cag aaa aat gta gag tgt gcc cgt gtg tat gcc gag aac gcc atc cgc 197 Gln Lys Asn Val Glu Cys Ala Arg Val Tyr Ala Glu Asn Ala Ile Arg 40 45 50 aag aag aac gaa ggt gtg aac tgg ctt cgg atg gcg tcc cgc gta gac 245 Lys Lys Asn Glu Gly Val Asn Trp Leu Arg Met Ala Ser Arg Val Asp 55 60 65 70 gca gtg gcc tcc aag gtg cag aca gct gtg act atg aag ggg gtg acc 293 Ala Val Ala Ser Lys Val Gln Thr Ala Val Thr Met Lys Gly Val Thr 75 80 85 aag aat atg gcc cag gtg acc aaa gcc ctg gac aag gcc ctg agc acc 341 Lys Asn Met Ala Gln Val Thr Lys Ala Leu Asp Lys Ala Leu Ser Thr 90 95 100 atg gac ctg cag aag gtc tcc tca gtg atg gac agg ttc gag cag cag 389 Met Asp Leu Gln Lys Val Ser Ser Val Met Asp Arg Phe Glu Gln Gln 105 110 115 gtg cag aac ctg gac gtc cat aca tcg gtg atg gag gac tcc atg agc 437 Val Gln Asn Leu Asp Val His Thr Ser Val Met Glu Asp Ser Met Ser 120 125 130 tcg gcc acc acc ctg acc acg ccg cag gag cag gtg gac agc ctc atc 485 Ser Ala Thr Thr Leu Thr Thr Pro Gln Glu Gln Val Asp Ser Leu Ile 135 140 145 150 atg cag atc gcc gag gag aat ggc ctg gag gtg ctg gac cag ctc agc 533 Met Gln Ile Ala Glu Glu Asn Gly Leu Glu Val Leu Asp Gln Leu Ser 155 160 165 cag ctg ccc gag ggc gcc tct gcc gtg ggc gag agc tct gtg cgc agc 581 Gln Leu Pro Glu Gly Ala Ser Ala Val Gly Glu Ser Ser Val Arg Ser 170 175 180 cag gag gac cag ctg tca cgg agg ttg gcc gcc ttg agg aac tag 626 Gln Glu Asp Gln Leu Ser Arg Arg Leu Ala Ala Leu Arg Asn 185 190 195 ccgtgccccg ccggtgtgca ccgcctctgc cccgtgatgt gctggaaggc tcctgtcctc 686 tccccaccgc gtcttgcctt tgtgctgacc ccgcggggct gcggccggca gccactctgc 746 gtctctcacc tgccaggcct gcgtggcctt agggttgttc ctgttctttt aggttgggcg 806 gtgggtctgt gtcctggtgt tgagtttctg caaatttctg ggggtgattt ctgtgactct 866 gggcccacag cggggaggcc aagaggggcc ctgtggactt tcacccagca ctgtgggggc 926 cttcagactc tggggcagca gacatgctgc ttcccatcag ccagaggggg tcagggctgc 986 cctgttgcca aacaactccc tgaggcctct ccgcaccacc tcagcgggca ggaggtccca 1046 ccatgtggac agacatagcc caaggaggca ccacaggtct atgtgtgctg ggggatgtca 1106 ggtgccaccc aacgctgtcc tggtggtatt tacaatgaca tcctcctcct ccatcactcc 1166 aggggtggtg tctcggccgc ccctaccagc tggctgagcc ccctggcctc ctgcgctccc 1226 tcacttccct cagttcccaa agctgcccag tccatgggga cagaaccgtc actcagatcc 1286 acattcaagt gtgcccaccc tgcagtcttc atcctcactc agctgctgcc tctggaggtg 1346 cctttggcca catgtgctgt gctgtttgtc tcctcgacag ggagcctgtc caccagcagg 1406 ctgcggtccc agcgggtgcg tctgcagctc ctccccttgg gcagcctggt tctcccggag 1466 gacctttcct tggggccctg cttcatgacg atgctgcctg tgtcaccctc taccatctgt 1526 aaacaactgg gtgccttccc cgaccacacc ccaatgcctt cccagcttgg aagccaaggc 1586 agctgatgaa gggagctcag gagagccgtc ttcagctggg ctggggttgg ggctgctgtg 1646 aggaaaacct gccattgtgg ccctggagag tcaccagcag ctcttgggaa ggacttgctg 1706 ggaggctgag agaggctttg ggcacagcct gctgtctttt ccatttccta aagtttactt 1766 cattgtcttg aggcttccag gttttgtttt tgtttttgcc aaagtagaaa aggcaggtgg 1826 tgggcggctg gcagggagtg cgggtccccg cccctcttca gtcctgccct cccctcctca 1886 gtcctgccca ccccgtgcag cccatgctga ggctgcagtg gtgtcgtggg tgttacgtgc 1946 aggaacgtgg agaccctgac gtgggctcac tgcgtttggt tttcttttca gaacttggga 2006 gcccccaggg aggggctagt gttggtaggt cctagacgtg gttccctcca gcctccccaa 2066 aatcaaccct ggtgttgaga gaacgtcctt ctgtccatcg tgggtaacag ccttggggag 2126 ggtgcagagc tctgcagagc catgggccag gtggggctgc ctcagtcctg tccccttggg 2186 cactgaggag aggggcccat tcacctttct cctagaatgc tgttgtaaat aaacaaatgg 2246 atccctgg 2254 18 196 PRT Homo sapiens 18 Met Asp Asp Thr Leu Phe Gln Leu Lys Phe Thr Ala Lys Gln Leu Glu 1 5 10 15 Lys Leu Ala Lys Lys Ala Glu Lys Asp Ser Lys Ala Glu Gln Ala Lys 20 25 30 Val Lys Lys Ala Leu Leu Gln Lys Asn Val Glu Cys Ala Arg Val Tyr 35 40 45 Ala Glu Asn Ala Ile Arg Lys Lys Asn Glu Gly Val Asn Trp Leu Arg 50 55 60 Met Ala Ser Arg Val Asp Ala Val Ala Ser Lys Val Gln Thr Ala Val 65 70 75 80 Thr Met Lys Gly Val Thr Lys Asn Met Ala Gln Val Thr Lys Ala Leu 85 90 95 Asp Lys Ala Leu Ser Thr Met Asp Leu Gln Lys Val Ser Ser Val Met 100 105 110 Asp Arg Phe Glu Gln Gln Val Gln Asn Leu Asp Val His Thr Ser Val 115 120 125 Met Glu Asp Ser Met Ser Ser Ala Thr Thr Leu Thr Thr Pro Gln Glu 130 135 140 Gln Val Asp Ser Leu Ile Met Gln Ile Ala Glu Glu Asn Gly Leu Glu 145 150 155 160 Val Leu Asp Gln Leu Ser Gln Leu Pro Glu Gly Ala Ser Ala Val Gly 165 170 175 Glu Ser Ser Val Arg Ser Gln Glu Asp Gln Leu Ser Arg Arg Leu Ala 180 185 190 Ala Leu Arg Asn 195 19 2136 DNA Homo sapiens CDS (153)..(1562) 19 gcagagcggc ggcttctctc gcgaggacgg acgccattat cgcatctccc cgacaaacac 60 cacgagaatt ccgcagccca cacggtgacc aaaagccagc cccactgtga gttgaactct 120 ttcgtgttga ccggccactc tccgtgctct gg atg atg tcg gaa cac gac ctg 173 Met Met Ser Glu His Asp Leu 1 5 gcc gat gtg gtt cag att gca gtg gaa gac ctg agc cct gac cac cca 221 Ala Asp Val Val Gln Ile Ala Val Glu Asp Leu Ser Pro Asp His Pro 10 15 20 gtt gtt ttg gag aat cat gta gtg aca gat gaa gac gaa cct gct ttg 269 Val Val Leu Glu Asn His Val Val Thr Asp Glu Asp Glu Pro Ala Leu 25 30 35 aaa cgc cag cga cta gaa atc aat tgc cag gat cca tct ata aag tca 317 Lys Arg Gln Arg Leu Glu Ile Asn Cys Gln Asp Pro Ser Ile Lys Ser 40 45 50 55 ttc ctg tat tcc atc aac cag aca atc tgc ttg cgg ttg gat agc att 365 Phe Leu Tyr Ser Ile Asn Gln Thr Ile Cys Leu Arg Leu Asp Ser Ile 60 65 70 gaa gcc aaa ttg caa gcc ctg gag gct act tgt aaa tcc tta gaa gaa 413 Glu Ala Lys Leu Gln Ala Leu Glu Ala Thr Cys Lys Ser Leu Glu Glu 75 80 85 aag ctg gat ctg gtc acg aac aag cag cac agc ccc atc cag gtc ccc 461 Lys Leu Asp Leu Val Thr Asn Lys Gln His Ser Pro Ile Gln Val Pro 90 95 100 atg gtg gcc ggc tcc cct ctc ggg gca acc cag acg tgc aac aaa gtg 509 Met Val Ala Gly Ser Pro Leu Gly Ala Thr Gln Thr Cys Asn Lys Val 105 110 115 cga tgc gct gtg cct ggg cgt cgg cag aac acc att gtg gtg aag gtg 557 Arg Cys Ala Val Pro Gly Arg Arg Gln Asn Thr Ile Val Val Lys Val 120 125 130 135 ccg ggc caa gaa gac agc cac cac gag gac ggg gag agc ggc tcg gag 605 Pro Gly Gln Glu Asp Ser His His Glu Asp Gly Glu Ser Gly Ser Glu 140 145 150 gcc agc gac tct gtg tcc agc tgt ggg cag gcg ggc agt cag agc atc 653 Ala Ser Asp Ser Val Ser Ser Cys Gly Gln Ala Gly Ser Gln Ser Ile 155 160 165 ggg agc aac gtc acg ctc atc acc ctg aac tcg gaa gag gac tac ccc 701 Gly Ser Asn Val Thr Leu Ile Thr Leu Asn Ser Glu Glu Asp Tyr Pro 170 175 180 aat ggc acc tgg ctg ggc gac gag aac aac ccc gag atg cgg gta cgc 749 Asn Gly Thr Trp Leu Gly Asp Glu Asn Asn Pro Glu Met Arg Val Arg 185 190 195 tgc gcc atc atc ccc tcc gac atg ctg cac atc agc acc aac tgc cgc 797 Cys Ala Ile Ile Pro Ser Asp Met Leu His Ile Ser Thr Asn Cys Arg 200 205 210 215 acg gcc gag aag atg gcg cta acg ctg ctg gac tac ctc ttc cac cgc 845 Thr Ala Glu Lys Met Ala Leu Thr Leu Leu Asp Tyr Leu Phe His Arg 220 225 230 gag gtg cag gct gtg tcc aac ctc tcg ggg cag ggc aag cac ggg aag 893 Glu Val Gln Ala Val Ser Asn Leu Ser Gly Gln Gly Lys His Gly Lys 235 240 245 aag cag ctg gac ccg ctc acc atc tac ggc atc cgg tgt cac ctt ttc 941 Lys Gln Leu Asp Pro Leu Thr Ile Tyr Gly Ile Arg Cys His Leu Phe 250 255 260 tat aaa ttt ggc atc aca gaa tcc gac tgg tac cga atc aag cag agc 989 Tyr Lys Phe Gly Ile Thr Glu Ser Asp Trp Tyr Arg Ile Lys Gln Ser 265 270 275 atc gac tcc aag tgc cgc acg gcg tgg cgg cgc aag cag cgg ggc cag 1037 Ile Asp Ser Lys Cys Arg Thr Ala Trp Arg Arg Lys Gln Arg Gly Gln 280 285 290 295 agc ctg gcg gtc aag agc ttc tcg cgg aga acg ccc aac tcg tcc tcc 1085 Ser Leu Ala Val Lys Ser Phe Ser Arg Arg Thr Pro Asn Ser Ser Ser 300 305 310 tac tgc cct tca gag ccg atg atg agc acc cca cct cct gcc agc gag 1133 Tyr Cys Pro Ser Glu Pro Met Met Ser Thr Pro Pro Pro Ala Ser Glu 315 320 325 ctc ccg cag cca cag ccg cag ccg cag gcc ctg cac tac gcg ctg gcc 1181 Leu Pro Gln Pro Gln Pro Gln Pro Gln Ala Leu His Tyr Ala Leu Ala 330 335 340 aac gca cag cag gtg cag atc cac cag atc gga gaa gac gga cag gtg 1229 Asn Ala Gln Gln Val Gln Ile His Gln Ile Gly Glu Asp Gly Gln Val 345 350 355 caa gta atc cca cag gga cac ctc cac atc gcc cag gtg ccg cag ggg 1277 Gln Val Ile Pro Gln Gly His Leu His Ile Ala Gln Val Pro Gln Gly 360 365 370 375 gag caa gtc cag atc acg cag gac agc gag ggc aac ctc cag atc cat 1325 Glu Gln Val Gln Ile Thr Gln Asp Ser Glu Gly Asn Leu Gln Ile His 380 385 390 cac gtg ggg cag gac ggt cag gtg ctg cag ggt gca cag ctg atc gcc 1373 His Val Gly Gln Asp Gly Gln Val Leu Gln Gly Ala Gln Leu Ile Ala 395 400 405 gtg gcc tcc tcg gac ccc gcg gcg gcg ggc gtg gat ggg tcg cca ctc 1421 Val Ala Ser Ser Asp Pro Ala Ala Ala Gly Val Asp Gly Ser Pro Leu 410 415 420 cag ggc agc gac atc cag gtt cag tac gtg cag ctg gcg cca gtg agt 1469 Gln Gly Ser Asp Ile Gln Val Gln Tyr Val Gln Leu Ala Pro Val Ser 425 430 435 gac cac acg gcc ggg gca cag acg gcc gaa gcc ctg cag ccc acg cta 1517 Asp His Thr Ala Gly Ala Gln Thr Ala Glu Ala Leu Gln Pro Thr Leu 440 445 450 455 cag ccg gag atg cag ctc gag cac ggg gcc atc cag att cag tga 1562 Gln Pro Glu Met Gln Leu Glu His Gly Ala Ile Gln Ile Gln 460 465 gcggtgccca tggcaccagg agcccctcgc cggctccgcc tacggcccgg cccccacgcg 1622 ccctgctctc acggcctcgg cacaggcagc ggctgcacgt gttctgctga agtgcgtctg 1682 aaggccgctg cctccgcggg gaacagcatc ctatgaactg aaagagcagc cgccgccgcc 1742 cccagccgga gacccctttc gtttgagtcc tgctgttggt gtcggagcac gaggggaggc 1802 acggtgcgga gagcgtcgca tatgcgcggg aaatcaagaa ctatgatatt tttctgttta 1862 aacagctttt tttaatttgc tatggtgttt ataacaaaaa agaaaatttg aaaaaaaaaa 1922 tcccagggga gtagcaggag ccctttgctg tgtgctctgt ccagtgtcat gagacgggag 1982 ccctttgctg tgtgctctgt ccagtgtcat gaggcaggtg tttgcaaagc cagctctcgg 2042 ttccgatggg gtattgctga cctacttttc taggggaaat gctcttaaac actgtaatta 2102 tgcatttcta atgaaataaa atgtatttat gacc 2136 20 469 PRT Homo sapiens 20 Met Met Ser Glu His Asp Leu Ala Asp Val Val Gln Ile Ala Val Glu 1 5 10 15 Asp Leu Ser Pro Asp His Pro Val Val Leu Glu Asn His Val Val Thr 20 25 30 Asp Glu Asp Glu Pro Ala Leu Lys Arg Gln Arg Leu Glu Ile Asn Cys 35 40 45 Gln Asp Pro Ser Ile Lys Ser Phe Leu Tyr Ser Ile Asn Gln Thr Ile 50 55 60 Cys Leu Arg Leu Asp Ser Ile Glu Ala Lys Leu Gln Ala Leu Glu Ala 65 70 75 80 Thr Cys Lys Ser Leu Glu Glu Lys Leu Asp Leu Val Thr Asn Lys Gln 85 90 95 His Ser Pro Ile Gln Val Pro Met Val Ala Gly Ser Pro Leu Gly Ala 100 105 110 Thr Gln Thr Cys Asn Lys Val Arg Cys Ala Val Pro Gly Arg Arg Gln 115 120 125 Asn Thr Ile Val Val Lys Val Pro Gly Gln Glu Asp Ser His His Glu 130 135 140 Asp Gly Glu Ser Gly Ser Glu Ala Ser Asp Ser Val Ser Ser Cys Gly 145 150 155 160 Gln Ala Gly Ser Gln Ser Ile Gly Ser Asn Val Thr Leu Ile Thr Leu 165 170 175 Asn Ser Glu Glu Asp Tyr Pro Asn Gly Thr Trp Leu Gly Asp Glu Asn 180 185 190 Asn Pro Glu Met Arg Val Arg Cys Ala Ile Ile Pro Ser Asp Met Leu 195 200 205 His Ile Ser Thr Asn Cys Arg Thr Ala Glu Lys Met Ala Leu Thr Leu 210 215 220 Leu Asp Tyr Leu Phe His Arg Glu Val Gln Ala Val Ser Asn Leu Ser 225 230 235 240 Gly Gln Gly Lys His Gly Lys Lys Gln Leu Asp Pro Leu Thr Ile Tyr 245 250 255 Gly Ile Arg Cys His Leu Phe Tyr Lys Phe Gly Ile Thr Glu Ser Asp 260 265 270 Trp Tyr Arg Ile Lys Gln Ser Ile Asp Ser Lys Cys Arg Thr Ala Trp 275 280 285 Arg Arg Lys Gln Arg Gly Gln Ser Leu Ala Val Lys Ser Phe Ser Arg 290 295 300 Arg Thr Pro Asn Ser Ser Ser Tyr Cys Pro Ser Glu Pro Met Met Ser 305 310 315 320 Thr Pro Pro Pro Ala Ser Glu Leu Pro Gln Pro Gln Pro Gln Pro Gln 325 330 335 Ala Leu His Tyr Ala Leu Ala Asn Ala Gln Gln Val Gln Ile His Gln 340 345 350 Ile Gly Glu Asp Gly Gln Val Gln Val Ile Pro Gln Gly His Leu His 355 360 365 Ile Ala Gln Val Pro Gln Gly Glu Gln Val Gln Ile Thr Gln Asp Ser 370 375 380 Glu Gly Asn Leu Gln Ile His His Val Gly Gln Asp Gly Gln Val Leu 385 390 395 400 Gln Gly Ala Gln Leu Ile Ala Val Ala Ser Ser Asp Pro Ala Ala Ala 405 410 415 Gly Val Asp Gly Ser Pro Leu Gln Gly Ser Asp Ile Gln Val Gln Tyr 420 425 430 Val Gln Leu Ala Pro Val Ser Asp His Thr Ala Gly Ala Gln Thr Ala 435 440 445 Glu Ala Leu Gln Pro Thr Leu Gln Pro Glu Met Gln Leu Glu His Gly 450 455 460 Ala Ile Gln Ile Gln 465 21 2127 DNA Homo sapiens CDS (153)..(1556) 21 gcagagcggc ggcttctctc gcgaggacgg acgccattat cgcatctccc cgacaaacac 60 cacgagaatt ccgcagccca cacggtgacc aaaagccagc cccactgtga gttgaactct 120 ttcgtgttga ccggccactc tccgtgctct gg atg atg tcg gaa cac gac ctg 173 Met Met Ser Glu His Asp Leu 1 5 gcc gat gtg gtt cag att gca gtg gaa gac ctg agc cct gac cac cca 221 Ala Asp Val Val Gln Ile Ala Val Glu Asp Leu Ser Pro Asp His Pro 10 15 20 gtt gtt ttg gag aat cat gta gtg aca gat gaa gac gaa cct gct ttg 269 Val Val Leu Glu Asn His Val Val Thr Asp Glu Asp Glu Pro Ala Leu 25 30 35 aaa cgc cag cga cta gaa atc aat tgc cag gat cca tct ata aag tca 317 Lys Arg Gln Arg Leu Glu Ile Asn Cys Gln Asp Pro Ser Ile Lys Ser 40 45 50 55 ttc ctg tat tcc atc aac cag aca atc tgc ttg cgg ttg gat agc att 365 Phe Leu Tyr Ser Ile Asn Gln Thr Ile Cys Leu Arg Leu Asp Ser Ile 60 65 70 gaa gcc aaa ttg caa gcc ctg gag gct act tgt aaa tcc tta gaa gaa 413 Glu Ala Lys Leu Gln Ala Leu Glu Ala Thr Cys Lys Ser Leu Glu Glu 75 80 85 aag ctg gat ctg gtc acg aac aag cag cac agc ccc atc cag gtc ccc 461 Lys Leu Asp Leu Val Thr Asn Lys Gln His Ser Pro Ile Gln Val Pro 90 95 100 atg gtg gcc ggc tcc cct ctc ggg gca acc cag acg tgc aac aaa gtg 509 Met Val Ala Gly Ser Pro Leu Gly Ala Thr Gln Thr Cys Asn Lys Val 105 110 115 cga tgc gct gtg cct ggg cgt cgg cag aac acc att gtg gtg aag gtg 557 Arg Cys Ala Val Pro Gly Arg Arg Gln Asn Thr Ile Val Val Lys Val 120 125 130 135 ccg ggc caa gaa gac agc cac cac gag gac ggg gag agc ggc tcg gag 605 Pro Gly Gln Glu Asp Ser His His Glu Asp Gly Glu Ser Gly Ser Glu 140 145 150 gcc agc gac tct gtg tcc agc tgt ggg cag gcg ggc agt cag agc atc 653 Ala Ser Asp Ser Val Ser Ser Cys Gly Gln Ala Gly Ser Gln Ser Ile 155 160 165 ggg agc aac gtc acg ctc atc acc ctg aac tcg gaa gag gac tac ccc 701 Gly Ser Asn Val Thr Leu Ile Thr Leu Asn Ser Glu Glu Asp Tyr Pro 170 175 180 aat ggc acc tgg ctg ggc gac gag aac aac ccc gag atg cgg gta cgc 749 Asn Gly Thr Trp Leu Gly Asp Glu Asn Asn Pro Glu Met Arg Val Arg 185 190 195 tgc gcc atc atc ccc tcc gac atg ctg cac atc agc acc aac tgc cgc 797 Cys Ala Ile Ile Pro Ser Asp Met Leu His Ile Ser Thr Asn Cys Arg 200 205 210 215 acg gcc gag aag atg gcg cta acg ctg ctg gac tac ctc ttc cac cgc 845 Thr Ala Glu Lys Met Ala Leu Thr Leu Leu Asp Tyr Leu Phe His Arg 220 225 230 gag gtg cag gct gtg tcc aac ctc tcg ggg cag ggc aag cac ggg aag 893 Glu Val Gln Ala Val Ser Asn Leu Ser Gly Gln Gly Lys His Gly Lys 235 240 245 aag cag ctg gac ccg ctc acc atc tac ggc atc cgg tgt cac ctt ttc 941 Lys Gln Leu Asp Pro Leu Thr Ile Tyr Gly Ile Arg Cys His Leu Phe 250 255 260 tat aaa ttt ggc atc aca gaa tcc gac tgg tac cga atc aag cag agc 989 Tyr Lys Phe Gly Ile Thr Glu Ser Asp Trp Tyr Arg Ile Lys Gln Ser 265 270 275 atc gac tcc aag tgc cgc acg gcg tgg cgg cgc aag cag cgg ggc cag 1037 Ile Asp Ser Lys Cys Arg Thr Ala Trp Arg Arg Lys Gln Arg Gly Gln 280 285 290 295 agc ctg gcg gtc aag agc ttc tcg cgg aga acg ccc aac tcg tcc tcc 1085 Ser Leu Ala Val Lys Ser Phe Ser Arg Arg Thr Pro Asn Ser Ser Ser 300 305 310 tac tgc cct tca gag ccg atg atg agc acc cca cct cct gcc agc gag 1133 Tyr Cys Pro Ser Glu Pro Met Met Ser Thr Pro Pro Pro Ala Ser Glu 315 320 325 ctc ccg cag cca cag ccg cag ccg cag gcc ctg cac tac gcg ctg gcc 1181 Leu Pro Gln Pro Gln Pro Gln Pro Gln Ala Leu His Tyr Ala Leu Ala 330 335 340 aac gca cag cag gtg cag atc cac cag atc gga gaa gac gga cag gtg 1229 Asn Ala Gln Gln Val Gln Ile His Gln Ile Gly Glu Asp Gly Gln Val 345 350 355 caa gta gga cac ctc cac atc gcc cag gtg ccg cag ggg gag caa gtc 1277 Gln Val Gly His Leu His Ile Ala Gln Val Pro Gln Gly Glu Gln Val 360 365 370 375 cag atc acg cag gac agc gag ggc aac ctc cag atc cat cac gtg ggg 1325 Gln Ile Thr Gln Asp Ser Glu Gly Asn Leu Gln Ile His His Val Gly 380 385 390 cag gac ggt cag gtg ctg cag ggt gca cag ctg atc gcc gtg gcc tcc 1373 Gln Asp Gly Gln Val Leu Gln Gly Ala Gln Leu Ile Ala Val Ala Ser 395 400 405 tcg gac ccc gcg gcg gcg ggc gtg gat ggg tcg cca ctc cag ggc agc 1421 Ser Asp Pro Ala Ala Ala Gly Val Asp Gly Ser Pro Leu Gln Gly Ser 410 415 420 gac atc cag gtt cag tac gtg cag ctg gcg cca gtg agt gac cac acg 1469 Asp Ile Gln Val Gln Tyr Val Gln Leu Ala Pro Val Ser Asp His Thr 425 430 435 gcc ggg gca cag acg gcc gaa gcc ctg cag ccc acg cta cag ccg gag 1517 Ala Gly Ala Gln Thr Ala Glu Ala Leu Gln Pro Thr Leu Gln Pro Glu 440 445 450 455 atg cag ctc gag cac ggg gcc atc cag att cag tga gcg gtgcccatgg 1566 Met Gln Leu Glu His Gly Ala Ile Gln Ile Gln Ala 460 465 caccaggagc ccctcgccgg ctccgcctac ggcccggccc ccacgcgccc tgctctcacg 1626 gcctcggcac aggcagcggc tgcacgtgtt ctgctgaagt gcgtctgaag gccgctgcct 1686 ccgcggggaa cagcatccta tgaactgaaa gagcagccgc cgccgccccc agccggagac 1746 ccctttcgtt tgagtcctgc tgttggtgtc ggagcacgag gggaggcacg gtgcggagag 1806 cgtcgcatat gcgcgggaaa tcaagaacta tgatattttt ctgtttaaac agcttttttt 1866 aatttgctat ggtgtttata acaaaaaaga aaatttgaaa aaaaaaatcc caggggagta 1926 gcaggagccc tttgctgtgt gctctgtcca gtgtcatgag acgggagccc tttgctgtgt 1986 gctctgtcca gtgtcatgag gcaggtgttt gcaaagccag ctctcggttc cgatggggta 2046 ttgctgacct acttttctag gggaaatgct cttaaacact gtaattatgc atttctaatg 2106 aaataaaatg tatttatgac c 2127 22 466 PRT Homo sapiens 22 Met Met Ser Glu His Asp Leu Ala Asp Val Val Gln Ile Ala Val Glu 1 5 10 15 Asp Leu Ser Pro Asp His Pro Val Val Leu Glu Asn His Val Val Thr 20 25 30 Asp Glu Asp Glu Pro Ala Leu Lys Arg Gln Arg Leu Glu Ile Asn Cys 35 40 45 Gln Asp Pro Ser Ile Lys Ser Phe Leu Tyr Ser Ile Asn Gln Thr Ile 50 55 60 Cys Leu Arg Leu Asp Ser Ile Glu Ala Lys Leu Gln Ala Leu Glu Ala 65 70 75 80 Thr Cys Lys Ser Leu Glu Glu Lys Leu Asp Leu Val Thr Asn Lys Gln 85 90 95 His Ser Pro Ile Gln Val Pro Met Val Ala Gly Ser Pro Leu Gly Ala 100 105 110 Thr Gln Thr Cys Asn Lys Val Arg Cys Ala Val Pro Gly Arg Arg Gln 115 120 125 Asn Thr Ile Val Val Lys Val Pro Gly Gln Glu Asp Ser His His Glu 130 135 140 Asp Gly Glu Ser Gly Ser Glu Ala Ser Asp Ser Val Ser Ser Cys Gly 145 150 155 160 Gln Ala Gly Ser Gln Ser Ile Gly Ser Asn Val Thr Leu Ile Thr Leu 165 170 175 Asn Ser Glu Glu Asp Tyr Pro Asn Gly Thr Trp Leu Gly Asp Glu Asn 180 185 190 Asn Pro Glu Met Arg Val Arg Cys Ala Ile Ile Pro Ser Asp Met Leu 195 200 205 His Ile Ser Thr Asn Cys Arg Thr Ala Glu Lys Met Ala Leu Thr Leu 210 215 220 Leu Asp Tyr Leu Phe His Arg Glu Val Gln Ala Val Ser Asn Leu Ser 225 230 235 240 Gly Gln Gly Lys His Gly Lys Lys Gln Leu Asp Pro Leu Thr Ile Tyr 245 250 255 Gly Ile Arg Cys His Leu Phe Tyr Lys Phe Gly Ile Thr Glu Ser Asp 260 265 270 Trp Tyr Arg Ile Lys Gln Ser Ile Asp Ser Lys Cys Arg Thr Ala Trp 275 280 285 Arg Arg Lys Gln Arg Gly Gln Ser Leu Ala Val Lys Ser Phe Ser Arg 290 295 300 Arg Thr Pro Asn Ser Ser Ser Tyr Cys Pro Ser Glu Pro Met Met Ser 305 310 315 320 Thr Pro Pro Pro Ala Ser Glu Leu Pro Gln Pro Gln Pro Gln Pro Gln 325 330 335 Ala Leu His Tyr Ala Leu Ala Asn Ala Gln Gln Val Gln Ile His Gln 340 345 350 Ile Gly Glu Asp Gly Gln Val Gln Val Gly His Leu His Ile Ala Gln 355 360 365 Val Pro Gln Gly Glu Gln Val Gln Ile Thr Gln Asp Ser Glu Gly Asn 370 375 380 Leu Gln Ile His His Val Gly Gln Asp Gly Gln Val Leu Gln Gly Ala 385 390 395 400 Gln Leu Ile Ala Val Ala Ser Ser Asp Pro Ala Ala Ala Gly Val Asp 405 410 415 Gly Ser Pro Leu Gln Gly Ser Asp Ile Gln Val Gln Tyr Val Gln Leu 420 425 430 Ala Pro Val Ser Asp His Thr Ala Gly Ala Gln Thr Ala Glu Ala Leu 435 440 445 Gln Pro Thr Leu Gln Pro Glu Met Gln Leu Glu His Gly Ala Ile Gln 450 455 460 Ile Gln 465 23 794 DNA Homo sapiens CDS (71)..(664) 23 gtagcggggc ggccgggcgg atccagcgca gccgggagac agatgcgagg cggcggtcag 60 gtgggatgct atg gaa tat gat gag aag ctg gcc cgt ttc cgg cag gcc 109 Met Glu Tyr Asp Glu Lys Leu Ala Arg Phe Arg Gln Ala 1 5 10 cac ctc aac ccc ttc aac aag cag tct ggg ccg aga cag cat gag cag 157 His Leu Asn Pro Phe Asn Lys Gln Ser Gly Pro Arg Gln His Glu Gln 15 20 25 ggc cct ggg gag gag gtc ccg gac gtc act cct gaa gag gcc ctg cct 205 Gly Pro Gly Glu Glu Val Pro Asp Val Thr Pro Glu Glu Ala Leu Pro 30 35 40 45 gag ctg ccc cct ggg gag ccg gaa ttc cgc tgc cct gaa cgc gtg atg 253 Glu Leu Pro Pro Gly Glu Pro Glu Phe Arg Cys Pro Glu Arg Val Met 50 55 60 gat ctc ggc ctg tct gag gac cac ttc tcc cgc cct gtg ggt ctg ttc 301 Asp Leu Gly Leu Ser Glu Asp His Phe Ser Arg Pro Val Gly Leu Phe 65 70 75 ctg gcc tct gac gtc cag cag ctg cgg cag gcg atc gag gag tgc aag 349 Leu Ala Ser Asp Val Gln Gln Leu Arg Gln Ala Ile Glu Glu Cys Lys 80 85 90 cag gtg att ctg gag ctg ccc gag cag tcg gag aag cag aag gat gcc 397 Gln Val Ile Leu Glu Leu Pro Glu Gln Ser Glu Lys Gln Lys Asp Ala 95 100 105 gtg gtg cga ctc atc cac ctc cgg ctg aag ctc cag gag ctg aag gac 445 Val Val Arg Leu Ile His Leu Arg Leu Lys Leu Gln Glu Leu Lys Asp 110 115 120 125 ccc aat gag gat gag cca aac atc cga gtg ctc ctt gag cac cgc ttt 493 Pro Asn Glu Asp Glu Pro Asn Ile Arg Val Leu Leu Glu His Arg Phe 130 135 140 tac aag gag aag agc aag agc gtc aag cag acc tgt gac aag tgt aac 541 Tyr Lys Glu Lys Ser Lys Ser Val Lys Gln Thr Cys Asp Lys Cys Asn 145 150 155 acc atc atc tgg ggg ctc att cag acc tgg tac acc tgc aca ggt ggg 589 Thr Ile Ile Trp Gly Leu Ile Gln Thr Trp Tyr Thr Cys Thr Gly Gly 160 165 170 ccg aga ccc aga cga gga gtg agg aat gag aga gac caa agt tcc tgc 637 Pro Arg Pro Arg Arg Gly Val Arg Asn Glu Arg Asp Gln Ser Ser Cys 175 180 185 ctc cgc tgg gct cac att cag atg tga gctatccagt gtggtggcca 684 Leu Arg Trp Ala His Ile Gln Met 190 195 ctagccaaac atggccagtc aaagttaaat taactaaata tgcatttcct cgttgtactg 744 tctacattgg acagggctgt tctggatgag aaattaaact agtgaattac 794 24 197 PRT Homo sapiens 24 Met Glu Tyr Asp Glu Lys Leu Ala Arg Phe Arg Gln Ala His Leu Asn 1 5 10 15 Pro Phe Asn Lys Gln Ser Gly Pro Arg Gln His Glu Gln Gly Pro Gly 20 25 30 Glu Glu Val Pro Asp Val Thr Pro Glu Glu Ala Leu Pro Glu Leu Pro 35 40 45 Pro Gly Glu Pro Glu Phe Arg Cys Pro Glu Arg Val Met Asp Leu Gly 50 55 60 Leu Ser Glu Asp His Phe Ser Arg Pro Val Gly Leu Phe Leu Ala Ser 65 70 75 80 Asp Val Gln Gln Leu Arg Gln Ala Ile Glu Glu Cys Lys Gln Val Ile 85 90 95 Leu Glu Leu Pro Glu Gln Ser Glu Lys Gln Lys Asp Ala Val Val Arg 100 105 110 Leu Ile His Leu Arg Leu Lys Leu Gln Glu Leu Lys Asp Pro Asn Glu 115 120 125 Asp Glu Pro Asn Ile Arg Val Leu Leu Glu His Arg Phe Tyr Lys Glu 130 135 140 Lys Ser Lys Ser Val Lys Gln Thr Cys Asp Lys Cys Asn Thr Ile Ile 145 150 155 160 Trp Gly Leu Ile Gln Thr Trp Tyr Thr Cys Thr Gly Gly Pro Arg Pro 165 170 175 Arg Arg Gly Val Arg Asn Glu Arg Asp Gln Ser Ser Cys Leu Arg Trp 180 185 190 Ala His Ile Gln Met 195 25 3525 DNA Homo sapiens CDS (71)..(1426) 25 gtagcggggc ggccgggcgg atccagcgca gccgggagac agatgcgagg cggcggtcag 60 gtgggatgct atg gaa tat gat gag aag ctg gcc cgt ttc cgg cag gcc 109 Met Glu Tyr Asp Glu Lys Leu Ala Arg Phe Arg Gln Ala 1 5 10 cac ctc aac ccc ttc aac aag cag tct ggg ccg aga cag cat gag cag 157 His Leu Asn Pro Phe Asn Lys Gln Ser Gly Pro Arg Gln His Glu Gln 15 20 25 ggc cct ggg gag gag gtc ccg gac gtc act cct gaa gag gcc ctg cct 205 Gly Pro Gly Glu Glu Val Pro Asp Val Thr Pro Glu Glu Ala Leu Pro 30 35 40 45 gag ctg ccc cct ggg gag ccg gaa ttc cgc tgc cct gaa cgc gtg atg 253 Glu Leu Pro Pro Gly Glu Pro Glu Phe Arg Cys Pro Glu Arg Val Met 50 55 60 gat ctc ggc ctg tct gag gac cac ttc tcc cgc cct gtg ggt ctg ttc 301 Asp Leu Gly Leu Ser Glu Asp His Phe Ser Arg Pro Val Gly Leu Phe 65 70 75 ctg gcc tct gac gtc cag cag ctg cgg cag gcg atc gag gag tgc aag 349 Leu Ala Ser Asp Val Gln Gln Leu Arg Gln Ala Ile Glu Glu Cys Lys 80 85 90 cag gtg att ctg gag ctg ccc gag cag tcg gag aag cag aag gat gcc 397 Gln Val Ile Leu Glu Leu Pro Glu Gln Ser Glu Lys Gln Lys Asp Ala 95 100 105 gtg gtg cga ctc atc cac ctc cgg ctg aag ctc cag gag ctg aag gac 445 Val Val Arg Leu Ile His Leu Arg Leu Lys Leu Gln Glu Leu Lys Asp 110 115 120 125 ccc aat gag gat gag cca aac atc cga gtg ctc ctt gag cac cgc ttt 493 Pro Asn Glu Asp Glu Pro Asn Ile Arg Val Leu Leu Glu His Arg Phe 130 135 140 tac aag gag aag agc aag agc gtc aag cag acc tgt gac aag tgt aac 541 Tyr Lys Glu Lys Ser Lys Ser Val Lys Gln Thr Cys Asp Lys Cys Asn 145 150 155 acc atc atc tgg ggg ctc att cag acc tgg tac acc tgc aca ggg tgt 589 Thr Ile Ile Trp Gly Leu Ile Gln Thr Trp Tyr Thr Cys Thr Gly Cys 160 165 170 tat tac cgc tgt cac agt aag tgc ttg aac ctc atc tcc aag ccc tgt 637 Tyr Tyr Arg Cys His Ser Lys Cys Leu Asn Leu Ile Ser Lys Pro Cys 175 180 185 gtg agc tcc aaa gtc agc cac caa gct gaa tac gaa ctg aac atc tgc 685 Val Ser Ser Lys Val Ser His Gln Ala Glu Tyr Glu Leu Asn Ile Cys 190 195 200 205 cct gag aca ggg ctg gac agc cag gat tac cgc tgt gcc gag tgc cgg 733 Pro Glu Thr Gly Leu Asp Ser Gln Asp Tyr Arg Cys Ala Glu Cys Arg 210 215 220 gcg ccc atc tct ctg cgg ggt gtg ccc agt gag gcc agg cag tgc gac 781 Ala Pro Ile Ser Leu Arg Gly Val Pro Ser Glu Ala Arg Gln Cys Asp 225 230 235 tac acc ggc cag tac tac tgc agc cac tgc cac tgg aac gac ctg gct 829 Tyr Thr Gly Gln Tyr Tyr Cys Ser His Cys His Trp Asn Asp Leu Ala 240 245 250 gtg atc cct gca cgc gtt gta cac aac tgg gac ttt gag cct cga aag 877 Val Ile Pro Ala Arg Val Val His Asn Trp Asp Phe Glu Pro Arg Lys 255 260 265 gtt tct cgc tgc agc atg cgc tac ctg gcg ctg atg gtg tct cgg ccc 925 Val Ser Arg Cys Ser Met Arg Tyr Leu Ala Leu Met Val Ser Arg Pro 270 275 280 285 gta ctc agg ctc cgg gag atc aac cct ctg ctg ttc agc tac gtg gag 973 Val Leu Arg Leu Arg Glu Ile Asn Pro Leu Leu Phe Ser Tyr Val Glu 290 295 300 gag ctg gtg gag att cgc aag ctg cgc cag gac atc ctg ctc atg aag 1021 Glu Leu Val Glu Ile Arg Lys Leu Arg Gln Asp Ile Leu Leu Met Lys 305 310 315 ccg tac ttc atc acc tgc agg gag gcc atg gag gct cgt ctg ctg ctg 1069 Pro Tyr Phe Ile Thr Cys Arg Glu Ala Met Glu Ala Arg Leu Leu Leu 320 325 330 cag ctc cag gat cgg cag cat ttt gtg gag aac gac gag atg tac tct 1117 Gln Leu Gln Asp Arg Gln His Phe Val Glu Asn Asp Glu Met Tyr Ser 335 340 345 gtc cag gac ctc ctg gac gtg cat gcc ggc cgc ctg ggc tgc tcg ctc 1165 Val Gln Asp Leu Leu Asp Val His Ala Gly Arg Leu Gly Cys Ser Leu 350 355 360 365 acc gag atc cac acg ctc ttc gcc aag cac atc aag ctg gac tgc gag 1213 Thr Glu Ile His Thr Leu Phe Ala Lys His Ile Lys Leu Asp Cys Glu 370 375 380 cgg tgc cag gcc aag ggc ttc gtg tgt gag ctc tgc aga gag ggc gac 1261 Arg Cys Gln Ala Lys Gly Phe Val Cys Glu Leu Cys Arg Glu Gly Asp 385 390 395 gtg ctg ttc ccg ttc gac agc cac acg tct gtg tgc gcc gac tgc tcc 1309 Val Leu Phe Pro Phe Asp Ser His Thr Ser Val Cys Ala Asp Cys Ser 400 405 410 gcg gtc ttc cac agg gac tgc tac tac gac aac tcc acc act tgt ccc 1357 Ala Val Phe His Arg Asp Cys Tyr Tyr Asp Asn Ser Thr Thr Cys Pro 415 420 425 aag tgt gcc cgg ctc agc ctg agg aag cag tcg ctc ttc cag gag cca 1405 Lys Cys Ala Arg Leu Ser Leu Arg Lys Gln Ser Leu Phe Gln Glu Pro 430 435 440 445 ggt ccc gat gtg gag gcc tag cgccgaggaa cagtgctggg caccccgcct 1456 Gly Pro Asp Val Glu Ala 450 ggcccgccag gacccaccct gccaacatca agttgttcct tctgctccgg agacccctgg 1516 ggtgcggccc tggccccctc cacccctgct gggccagagc gggtgggcag tgtcaaggcc 1576 cgctgtctcc caggtgcttg ctgggactcg gggcggctgc acctggctgt cacctgggtg 1636 tgctgctgtg aggggtcctt gcgtggcccc catccttccc ccaatgcaga actccatggg 1696 cagggagctg gggggacatc tcacctcccc catggcacag agccctccac acccctggac 1756 cagggcatcc gggccctaga aattccacag ctcccgtcct ggccaccctg gaagctcatc 1816 aggccaagac ccggacagag cttcagagga gtgttgagtg acacctgagg atgcggctgc 1876 acacactcag ccaagggccg agtctcacct gcggtggggt ttcggctctg cctgggggct 1936 ccatcccttt cagccactcg tggccttggg gatttctggt tgtccccagc tgggactgtt 1996 cacagttgtc acctgcagac ctgcctctcc ctggcctgag gttcaaaggc ctcatcggat 2056 ggtcagtaca gtggggtcac ctgttgtttc tatacaacag cagggaaggg gccatggagc 2116 ttttccctgc tgggtgctcc tgctttggcc cagcccacct ttcctggtgc tccaagctag 2176 gaggctgtgg ccccagcctg aggagggtgt cctggcctcc aggtgtgcag caggggctgt 2236 gtgctggggg aggttccagt taggcgatgg gatcctgcag tggtctggtg gcatttcttg 2296 gaaccagatt tacctgagga gctctgtcct gctccctgtg gagggctcca gatagctcag 2356 aaatgaccag ccaatggcct tttgtttggg ggcctgaggt caagagagct gagagtattc 2416 gctcgactga gcacattcag gaagatcagg gcaggcgtgt gggaggtccc tcactccacg 2476 ggacagaggc ccctggacag cagaggaaac ctacagctct gggtgagggg acacttggct 2536 ttggtgtttg cactttacag atcctgcggt ccacgagggg cctcaggaga ggacgtgtca 2596 ggacgtggct tcccagcctt ctgccttggg cagtgggggt gctcctgtct gtccttttcc 2656 cccacacccc tggactgtgc ttggctgttg gtgcacatgg ttggcacacg gtgggcagag 2716 ggcagagaat gccactgctt ggttattggt cccctttgac caggaaaccc aagaggagac 2776 acctcagtca gcagaaaggc cacctggctc actggctcat tccaggagtg ggagagacgg 2836 cagggtctcc tctttgtcct ccggcatcag gaaggggatg gtgtccactc cccactgtgg 2896 tggctttagg caaggttctt attgtctgct ctgcctcggt ttccccatct ggaaaatggg 2956 ggcaggggtc ctgacctacc tcaggtggaa cggtgagcag ggaacatgtc ggagtccttc 3016 agagaatgtg atgtgaggtt ggatcaacag tgtgggttcc tgtcctgttt ccccttcctc 3076 tttggggctg aggaggaggt taaaggccaa atgctgtttc ccaacacccc aaagtctgca 3136 cacgtctcat gaatgcatca catttctgtc atatggatat tagccattcc gaaatctgtg 3196 taatcaactt cacattattc aagttacaaa tcactgtgtc catagaaaaa ctgtgctggt 3256 atttgctgga caaagggttg ggcccctttt atttttacct gccacccagc atctccccca 3316 catgcccctt ctgggtgaca cagccggtaa acggaatcaa cgtatggttc tttctgtggg 3376 tctgtggcac agcaggaaga gcccggtgcc gccagcacct tgtggaagac cacacatggg 3436 tggtcccaca gcatgggacc aggctggcct gagggatgcc cagttgtaac aatgctgctg 3496 tcactgtctc attaaatata catccttta 3525 26 451 PRT Homo sapiens 26 Met Glu Tyr Asp Glu Lys Leu Ala Arg Phe Arg Gln Ala His Leu Asn 1 5 10 15 Pro Phe Asn Lys Gln Ser Gly Pro Arg Gln His Glu Gln Gly Pro Gly 20 25 30 Glu Glu Val Pro Asp Val Thr Pro Glu Glu Ala Leu Pro Glu Leu Pro 35 40 45 Pro Gly Glu Pro Glu Phe Arg Cys Pro Glu Arg Val Met Asp Leu Gly 50 55 60 Leu Ser Glu Asp His Phe Ser Arg Pro Val Gly Leu Phe Leu Ala Ser 65 70 75 80 Asp Val Gln Gln Leu Arg Gln Ala Ile Glu Glu Cys Lys Gln Val Ile 85 90 95 Leu Glu Leu Pro Glu Gln Ser Glu Lys Gln Lys Asp Ala Val Val Arg 100 105 110 Leu Ile His Leu Arg Leu Lys Leu Gln Glu Leu Lys Asp Pro Asn Glu 115 120 125 Asp Glu Pro Asn Ile Arg Val Leu Leu Glu His Arg Phe Tyr Lys Glu 130 135 140 Lys Ser Lys Ser Val Lys Gln Thr Cys Asp Lys Cys Asn Thr Ile Ile 145 150 155 160 Trp Gly Leu Ile Gln Thr Trp Tyr Thr Cys Thr Gly Cys Tyr Tyr Arg 165 170 175 Cys His Ser Lys Cys Leu Asn Leu Ile Ser Lys Pro Cys Val Ser Ser 180 185 190 Lys Val Ser His Gln Ala Glu Tyr Glu Leu Asn Ile Cys Pro Glu Thr 195 200 205 Gly Leu Asp Ser Gln Asp Tyr Arg Cys Ala Glu Cys Arg Ala Pro Ile 210 215 220 Ser Leu Arg Gly Val Pro Ser Glu Ala Arg Gln Cys Asp Tyr Thr Gly 225 230 235 240 Gln Tyr Tyr Cys Ser His Cys His Trp Asn Asp Leu Ala Val Ile Pro 245 250 255 Ala Arg Val Val His Asn Trp Asp Phe Glu Pro Arg Lys Val Ser Arg 260 265 270 Cys Ser Met Arg Tyr Leu Ala Leu Met Val Ser Arg Pro Val Leu Arg 275 280 285 Leu Arg Glu Ile Asn Pro Leu Leu Phe Ser Tyr Val Glu Glu Leu Val 290 295 300 Glu Ile Arg Lys Leu Arg Gln Asp Ile Leu Leu Met Lys Pro Tyr Phe 305 310 315 320 Ile Thr Cys Arg Glu Ala Met Glu Ala Arg Leu Leu Leu Gln Leu Gln 325 330 335 Asp Arg Gln His Phe Val Glu Asn Asp Glu Met Tyr Ser Val Gln Asp 340 345 350 Leu Leu Asp Val His Ala Gly Arg Leu Gly Cys Ser Leu Thr Glu Ile 355 360 365 His Thr Leu Phe Ala Lys His Ile Lys Leu Asp Cys Glu Arg Cys Gln 370 375 380 Ala Lys Gly Phe Val Cys Glu Leu Cys Arg Glu Gly Asp Val Leu Phe 385 390 395 400 Pro Phe Asp Ser His Thr Ser Val Cys Ala Asp Cys Ser Ala Val Phe 405 410 415 His Arg Asp Cys Tyr Tyr Asp Asn Ser Thr Thr Cys Pro Lys Cys Ala 420 425 430 Arg Leu Ser Leu Arg Lys Gln Ser Leu Phe Gln Glu Pro Gly Pro Asp 435 440 445 Val Glu Ala 450 

1. A method for the diagnosis of a predisposition to cancer, in a patient, comprising the steps of: (3) establishing the level of expression of a gene selected from the group consisting of PR domain containing 7 protein (PRDM7), CDT1 DNA replication factor (CDT1), charged multivesicular body protein 1/chromoatin modifying protein 1 (CHMP1), BTG3 associated nucleoprotein (BANP), BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44; and (4) comparing expression of the gene to a baseline established from expression in normal tissue controls; wherein substantial variance from the baseline indicates that the patient is susceptible to cancer.
 2. A method for determining whether a human tissue is predisposed to a neo-plastic transformation, comprising determining whether in a cell from the tissue a nucleic acid molecule selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44 is absent, present in a mutant form or down-regulated through epigenetic mechanisms.
 3. A method according to claim 2 wherein the human tissue is human breast tissue.
 4. A method as claimed in claim 2 comprising determining whether the encoded polypeptide is absent or expressed at reduced levels.
 5. A method as claimed in claim 4 comprising contacting the cell with an antibody for binding the peptide under conditions which permit the antibody to bind the peptide if it is present.
 6. A mutant form of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44 in which the tumour suppressor activity of the gene is compromised.
 7. A polypeptide encoded by a mutant form of a gene as defined in claim
 6. 8. An antibody to a mutant form of a gene as defined in claim
 6. 9. An isolated nucleotide molecule selected from the group consisting of nucleotide molecules having the sequence set forth in SEQ ID NO:5; SEQ ID NO:9; SEQ ID NO:23 and SEQ ID NO:25.
 10. An isolated polypeptide having the amino acid sequence set forth in any one of the group consisting of SEQ ID NO:6; SEQ ID NO:10; SEQ ID NO:12; SEQ ID NO:14; SEQ ID NO:16; SEQ ID NO:24 and SEQ ID NO:26.
 11. A polypeptide encoded by a gene comprising a nucleotide sequence set forth in any one of the group consisting of SEQ ID NO:5; SEQ ID NO:9, SEQ ID NO:11; SEQ ID NO:13; SEQ ID NO:15; SEQ ID NO:23 and SEQ ID NO:25.
 12. An antibody to a polypeptide as defined in claim
 11. 13. An antibody to a polypeptide having an amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24 and SEQ ID NO:26.
 14. Use of a nucleotide molecule having the nucleotide sequence set forth in SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22 and SEQ ID NO:25 in the diagnosis of cancer or in establishing the prognosis of a patient diagnosed with cancer.
 15. Use as claimed in claim 14 wherein the cancer is breast cancer.
 16. Use of a polypeptide having an amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24 and SEQ ID NO:26 in the diagnosis of cancer or in establishing the prognosis of a patient diagnosed with cancer.
 17. Use of a polypeptide as claimed in claim 11 in the diagnosis of cancer or in establishing the prognosis of a patient diagnosed with cancer.
 18. Use as claimed in either one of claims 16 or 17 wherein the cancer is breast cancer.
 19. Use of an antibody as claimed in claim 12 in the diagnosis of cancer or in establishing the prognosis of a patient diagnosed with cancer.
 20. Use of an antibody as claimed in claim 13 in the diagnosis of cancer or in establishing the prognosis of a patient diagnosed with cancer.
 21. Use as claimed in either one of claims 19 or 20 wherein the cancer is breast cancer.
 22. A microarray comprising oligonucleotides or longer fragments derived from any one or more of the genes PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44.
 23. A method of treating or inhibiting breast cancer in a patient in need of such treatment comprising the steps of administering to said patient a vector capable of expressing a gene selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44.
 24. A method of treating or inhibiting cancer in a patient in need of such treatment, said method comprising administering to said patient a compound which increases expression or activity of a gene selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44.
 25. Use of a nucleotide molecule having the nucleotide sequence set forth in SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22 and SEQ ID NO:25 in the treatment of cancer through gene therapy.
 26. Use of a polypeptide having the amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24 and SEQ ID NO:26 in the treatment of cancer.
 27. Use of a polypeptide as claimed in claim 11 in the treatment of cancer.
 28. Use of an antibody as claimed in claim 12 in the treatment of cancer.
 29. Use of an antibody as claimed in claim 13 in the treatment of cancer.
 30. Use as claimed in any one of claims 23 to 29 wherein the cancer is breast cancer.
 31. A genetically modified non-human animal in which a gene selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44 has been inactivated by knockout deletion.
 32. A genetically modified non-human animal as claimed in claim 31 wherein the animal is selected from the group consisting of rats, mice, hamsters, guineas pigs, rabbits, dogs, cats, goats, sheep, pigs and non-human primates such as monkeys and chimpanzees.
 33. A method of screening for candidate drugs which restore tumour suppressor activity, comprising the steps of: (1) contacting a cell in which expression of a gene selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44 is compromised, with a candidate drug; and (2) assaying for the level of tumour suppressor activity in the cell.
 34. The use of a drug identified by the method of claim 33 in the treatment of cancer.
 35. Use of a drug as claimed in claim 34 in the manufacture of a medicament for the treatment of cancer.
 36. Use as claimed in any one of claims 34 or 35 wherein the cancer is breast cancer.
 37. A compound which increases expression or activity of a gene selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44.
 38. An anti-cancer drug when identified by the method of claim
 25. 39. A cell comprising an expression vector capable of expressing PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44.
 40. A cell transformed with a nucleotide molecule having the sequence set forth in SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22 and SEQ ID NO:25.
 41. A pharmaceutical composition comprising any one or more of the genes selected from the group consisting of PRDM7, CDT1, CHMP1, BANP, BNO224, BNO36, BNO34, BNO208, BNO230 and BNO44, active fragments thereof, their expression products and antibodies to their expression products, and an inert carrier.
 42. A pharmaceutical composition comprising any one or more of a polypeptide having the amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24 and SEQ ID NO:26, a polypeptide as claimed in claim 22, an agonist of any such polypeptide, an antibody as claimed in claim 12 or 13, an expression vector according to claim 23, a compound according to claim 24, or a candidate drug identified by the method of claim 33, and an inert carrier. 